1 / 6

Advancing Web Understanding: Knowledge-Based Search Engines

Learn how Google and Microsoft revolutionized search with knowledge-based techniques, extracting concepts and categorizing information for more accurate results. Explore the "Web of Concepts" and the Memory Cloud model for efficient data processing. Discover the limitations and future prospects of language comprehension in search engines.

amaris
Télécharger la présentation

Advancing Web Understanding: Knowledge-Based Search Engines

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. How Google and Microsoft taught search to “understand” the Web Austin Granger Chris Hesemann

  2. Knowledge of the Web • String searching does not always convey the true meaning of content. • Search by knowledge, not by sub-string matching. • Extracting and categorizing concepts allows for knowledge-based searching.

  3. “Web of Concepts” • Extract raw data (phone numbers, addresses, prices, etc.). • Link related entities together (e.g., link actor to movie). • Categorize information about each entity (what does this store sell, what has this author written, how highly are they reviewed?).

  4. Search engines discover webpages, parse them into objects and data, process them and store the data, updating existing entries as needed. • “Concept web” stored in vast databases. • Not traditional databases. • Based on graph theory, not relational model. • Database consists of nodes and links.

  5. Memory Cloud • To make this efficient we must traverse the entire graph in milliseconds. • One solution – “memory cloud.” • Store entire database within memory at all times. • Example: Google search “blowfish” • Results: Show company, encryption algorithm, sushi • New results: Suggest “pufferfish”

  6. Limitations • Currently only works in English. • Including other languages increases the complexity exponentially, we’ve got a long way to go. • Dissecting language to understand searches written in normal language, not just keywords. The Future of Knowledge Searching

More Related