1 / 18

Facilitating Semantic Web Search with Embedded Grammar Tags (EGTs)

Facilitating Semantic Web Search with Embedded Grammar Tags (EGTs). Gautham K.Dorai Yaser Yacoob Department of Computer Science University of Maryland – College Park. The Future – A Forecast. What is the value of Nasdaq today ?. The value is 10746.2!!. Speech Grammar based Search Engine.

olaf
Télécharger la présentation

Facilitating Semantic Web Search with Embedded Grammar Tags (EGTs)

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Facilitating Semantic Web Search with Embedded Grammar Tags (EGTs) Gautham K.Dorai Yaser Yacoob Department of Computer Science University of Maryland – College Park

  2. The Future – A Forecast What is the value of Nasdaq today ? The value is 10746.2!! Speech Grammar based Search Engine WWW ???

  3. Roadmap • Forecast • Problem Statement and Our Solution • Related Work • Demonstration • Summary • Future Work

  4. Problem Statement(1) • Web Content is represented for human consumption • Software agents do not have interpretive tools for semantic information recovery • Hence agents cannot understand web content

  5. Problem Statement(2) • Web content does not support queries by natural language interaction • e.g. : Query : “What is the weather at College Park” - searches lead to links on related subject content

  6. Our Solution(1) • We embed natural language queries in the web content • Embedded Grammar Tags (EGTs) represent queries in a general (parseable) format • Discovery of relevant response by EGT matching

  7. EGT – The Big Picture HTML Page EGT Annotation Internet QUERY NLP EGT Search Web Search Engines

  8. Our Solution(2) • EGT uses the general BNF grammar format to represent queries • E.g.: <ROBOTGRAM-IN> * [is] the temperature [is] at College Park </ROBOTGRAM-IN> • Captures queries such as - What is the temperature at College Park ? - Tell me what the temperature is at College Park ?

  9. Our Solution(3) • EGT structure : * can be replaced by any word/set of words () mandatory words for EGT match [] optional words • Web Content is annotated with EGTs e.g: <td valign= “bottom”><span class =“redtemps”> <ROBOTGRAM-IN> * [is] the weather [is] at College Park [is]</ROBOTGRAM-IN> mostly sunny

  10. Our Solution(4) • EGTs - More examples : <span class = “FSnPSmDk”> Wind: </span> <ROBOTGRAM-IN> * [is the] [wind] (speed|velocity) [of the] [wind] [is] at <city> [is] </ROBOTGRAM-IN> 3mph <td align = “RIGHT” bgcolor=“#DDDDDD”><font face = “arial , helevetica, sans-serif” size = “2” class = “mkcharttxt”> <ROBOTGRAM-IN> * [is] [Nasdaq *] [the] (value|quote|price) [of Nasdaq] * </ROBOTGRAM-IN> 2222.42 </font></td>

  11. Related Work(1) • Natural Language Processing (NLP) – attempt to uncover meaning in HTML content • DAML, RDF, SHOE, XML – add metadata to describe the web content Facilitate more efficient content search

  12. Related Work(2) • E.g. : <v:Email> gautham@engineer.com </v:Email> <v:Name> Gautham Dorai </v:Name> - special tags (<v:Email>,<v:Name>) to describe content • Fine grained natural language queries on the content require an expandable universally available tag database

  13. Why EGTs ? (1) • RDF Triple can also be used e.g: College Park - [weatherat]  mostly sunny Nasdaq - [quoteof]  2222.42 • But EGTs are naturally expandable and more amenable to change • Simplifies search engine complexity

  14. Why EGTs ?(2) • EGTs describe content in an unconstrained format • EGTs are already present in speech recognition technology • Ease of transition from visualphone browsers

  15. Demonstration • We annotate a given home-page with EGTs • The user can query the content in natural language • Search engine parses the web page for EGT match and responds

  16. EGT Annotator (Preliminary) • Create a template page that is EGT-ready, i.e., EGTs are transparent to the user • The template is for home-pages at CS Dept. • The user can simply copy information from the HTML page onto the annotator

  17. Summary • EGTs enable software agents to respond to natural language queries • An EGT search engine can be implemented on top of conventional content search engines • Responses are constructed based on the extracted information from an EGT match

  18. Future Work • Expandable Universal Grammar - Universally Available query grammar packages • EGT Recognition Metrics - Statistical Analysis to search for EGT matches • EGT Crawler - Crawler that parses through EGT annotated web-pages

More Related