160 likes | 303 Vues
e-Business Architecture with Enterprise Search Engine. KOCSEA 2008 Jongwook Woo, PhD jwoo5@calstatela.edu High-Performance Internet Computing Center Computer Information Systems Department California State University, Los Angeles. Abstract.
E N D
e-Business Architecture with Enterprise Search Engine KOCSEA 2008 Jongwook Woo, PhD jwoo5@calstatela.edu High-Performance Internet Computing Center Computer Information Systems Department California State University, Los Angeles
Abstract • e-Business architecture has been updated dramatically for the past 10 years. The architecture moved from the traditional client-server architecture to n-tier architecture since Internet and Web came out to the world. Internet and web even create new product and revenue as companies have invested in Information Technology seriously not as an option. Content is one of the examples in this new era. Content has become intellectual property and a company or organization that has content can make profit by providing the content to the customers and the users. As the content increases high, there is a need for search engine in order to provide the proper content to the user quickly. Thus, search engine has received highlight in the content industry, mostly portal sites. The paper introduces search engine - especially internal search engine not web search engine - and its fundamentals. Then, e-Business architecture with search engine is illustrated.
Search Engine • Information retrieval system • Help users find what they need • In Internet or Intranet • Minimize time required to find information • search engines • Web search engine • Enterprise search engine • Ex: Apache Lucene, Solr, FAST
Search Engine Basics • Indexing • Querying • Ranking • Lemmatization and Phrase search • Geo search • Search Navigation
Querying • Basic of all search engine • The user uses query to search • With search terms AND, OR, NOT, NEAR • The process of specify criteria about an item on interest and have the engine find the matching items. • Syntax of querying • Formalized • Natural Language Search
Indexing • How data is collected and stored • then retrieve once a user do a search. • Optimize the speed and performance of finding relevant documents. • Need to determine ranking by Relevance and similarity • Determines the order in which it is presented to the user
Lemmatized/Phrase Search • Lemmatized • A search which allows the user to enter a lemma and return the inflected forms • Ex: hamburger vs hamburgers • Phrase • A query that matches documents containing a particular sequence of terms. • Ex: “The Great Wall”
Others • Geo-search • Parse text and identify the coordinates of the geographic entities mentioned in the query • Ex: East Palo Alto CA -> Latitude: 37.47 N, Longitude: 122.14 W • Facet Browsing • Search result can be categorized • and displays the counts of each search results categorized
Local (Enterprise) Search • Used by corporation to search in their own intranets to retrieve information. • Either used in public or not • Portal sites provides and sells contents • Contents is digital info: • text, multimedia data etc • Ex: citysearch.com, insiderpages.com, superpages.com etc
Web Search • Used to search the World Wide Web • Ex: • www.Google.com • www.msn.com • www.Yahoo.com • Probably no more market to compete with these dinosaurs
Legacy N-Tier Architecture • Presentation Logic • Only for the user • Business Logic • Data Access Logic • Normally DB • Search query communicates with DB
n-Tier Architecture N-Tier Web Architecture DB Server The user searches the site DB Access Library Business Logic Presentation Logic
Latest N-Tier Architecture • Presentation Logic • Both for the user and the author • Business Logic • Data Access Logic • Both for DB • Search Access Logic • Connects Search Engine that search query communicates with
n-Tier Architecture N-Tier Web Architecture with Enterprise Search Engine Search Engine DB Server Load/index DB to Search DB Search DB The user searches the site Search Access Library DB Access Library Business Logic Presentation Logic
Summary • Search Engine should not be optional for Internet Business • Enterprise Search has been popular by providing contents to the user and the third parties • as contents are product • Don’t need to compete with Web Search Business such as Google, Yahoo, MSN etc