1 / 17

Lexical Markup Framework & LingInfo

Lexical Markup Framework & LingInfo. Paul Buitelaar Competence Center Semantic Web & Language Technology Lab DFKI GmbH - Saarbrücken, Germany contributions by Michael Sintek and others (LingInfo); Nils Reiter (lexical enrichment). Overview. Lexical Markup Framework (LMF)

Télécharger la présentation

Lexical Markup Framework & LingInfo

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Lexical Markup Framework & LingInfo Paul Buitelaar Competence Center Semantic Web & Language Technology Lab DFKI GmbH - Saarbrücken, Germany contributions by Michael Sintek and others (LingInfo); Nils Reiter (lexical enrichment)

  2. Overview • Lexical Markup Framework (LMF) • Motivation, Model, Example • LingInfo • Motivation, Model, Example • Discussion

  3. LMF Motivation • Provide a common model for the creation and use of lexical resources • Manage the exchange of data between and among these resources • Enable the merging of individual electronic resources to form extensive global electronic resources

  4. LMF Model – “Core Package”

  5. LMF Model – “NLP Semantics”

  6. LMF Example – ‘Homonymy’

  7. LingInfo Motivation • Lexicalized Ontologies • Representation of terms instead of ontology class labels • Lexical (morphosyntactic) information for such terms – to handle morphological& syntactic term variations and disambiguation • (Lexical) Semantics is strictly in the (domain) ontology • Lexical Ontology Enrichment • Increasing number of ontologies published on (Semantic) Web • Reuse of knowledge in semantic annotation, ontology-based information extraction, question answering, etc. • Enrichment/integration of ontologies with lexical knowledge – terminology, synonyms, translations, morphosyntactic information

  8. Student studies_at located_at University Campus works_at is_part_of Staff School has_German_term Fakultät has_Dutch_term has_US-English_term Faculteit School Ontologies – Multilingual Labels (RDF)

  9. Student studies_at located_at University Campus works_at is_part_of Staff School has_term Term instance_of instance_of Fakultät faculteit school language language language DE NL EN-US Ontologies – Multilingual Terms (~OMV)

  10. Lexicalized Ontologies - LingInfo SCHOOL hasLingInfo LingInfo instanceOf Term-1 hasMorphSynInfo hasOrthographicForm WordForm-1 hasLang fakulteitsgebouw NL hasPoS hasStem hasStem “department building” “school” Term-2 Term-3 N hasOrthographicForm hasOrthographicForm “department” “school” gebouw fakulteit “building”

  11. Mapping Lexical to Semantic Structure isLocatedAt SCHOOL BUILDING hasLingInfo hasLingInfo LingInfo LingInfo instanceOf instanceOf instanceOf Term-1 hasMorphSynInfo hasOrthographicForm WordForm-1 hasLang fakulteitsgebouw NL hasPoS hasStem hasStem “department building” “school” Term-2 Term-3 N hasOrthographicForm hasOrthographicForm “department” “school” gebouw fakulteit “building”

  12. Lexical Enrichment • Derive synonyms from WordNet • Check if class names are lexical entries in WordNet • Extract synonyms from corresponding synsets • Sense Disambiguation (pick the right synset) • Derive translations from Wikipedia • Check if class names correspond to Wikipedia pages • Extract translations through “Interlanguage links“ • Sense Disambiguation • Derive morphosyntactic information from corpora • Reverse engineer the lexicon behind PoS-tagger / Morph Analyzer (advantage: disambiguation in context)

  13. LingInfo Info http://olp.dfki.de/LingInfo/

  14. Discussion – Homonymy (bank) LingInfo Ontology ... WordForm hasMorphSynInfo ... LingInfo ISO LangCode hasLang Geo Ontology Finance Ontology Z278 Y345 X123 X100 hasLingInfo hasLingInfo hasLingInfo instanceOf instanceOf instanceOf X100.1 X123.1 Z278.1 hasMorphSynInfo hasMorphSynInfo hasMorphSynInfo hasOrthographicForm hasOrthographicForm hasOrthographicForm hasLang hasLang hasLang river EN bank EN money EN

  15. LingInfo Ontology hasMorphSynInfo LingInfo hasLang Discussion – Metonymy (car-engine) “switch off the car” > ‘switch off the engine (process) of the car’ SUMO DOLCE 39890290 ‘artifact’ 23908239 ‘process’ Automotive Ontology hasPart 4657 8393 hasLingInfo hasLingInfo instanceOf instanceOf 4657.1 8393.1 hasMorphSynInfo hasMorphSynInfo hasOrthographicForm hasOrthographicForm hasLang hasLang car EN engine EN

  16. LingInfo Ontology hasMorphSynInfo LingInfo hasLang Discussion – Syst. Polysemy (act/human) “John is an excellent catcher / a fraud / a new arrival / a pushover / ...” http://www.dfki.de/~paulb/corelex.html SUMO DOLCE 39495672 ‘human’ 468934208 ‘act’ Sports Ontology hasAction 223AX 345ZD hasLingInfo hasLingInfo instanceOf instanceOf 223AX.1 345ZD.1 hasMorphSynInfo hasMorphSynInfo hasOrthographicForm hasOrthographicForm hasLang hasLang catcher EN to catch EN

  17. Thanks for Your Attention! http://olp.dfki.de/LingInfo/ http://olp.dfki.de/OntoSelect/ http://www.dfki.de/~paulb/corelex.html

More Related