1 / 15

Extended Named Entity Ontology with Attribute Information

Extended Named Entity Ontology with Attribute Information. LREC 2008 May 28, 2008. Satoshi Sekine New York University. Named Entity. Named Entity is the most important information unit in many Information Access applications (such as IE, Q&A, Summarization, IR, MT) History

adelie
Télécharger la présentation

Extended Named Entity Ontology with Attribute Information

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Extended Named Entity Ontologywith Attribute Information LREC 2008 May 28, 2008 Satoshi Sekine New York University

  2. Named Entity • Named Entity is the most important information unit in many Information Access applications (such as IE, Q&A, Summarization, IR, MT) • History • MUC6First define Named Entity • Person, Location, Organization, Date, Time, Money, Percent • IREX • MUC6 + Artifact • ACE (20 kinds),TIMEX (Standerdized Time Expression) • Problem: Is it enough with 7~20 categories? What is the meaning of names?

  3. Extended Named Entities • Extended to 200 categories (LREC 02,04) • Finer categories • Location →GPE(Country, Province, City…)   → Geographical region (landform, water form …)   → Region(Domestic region, Continental region …)   → Astral body(Star, Planet …) • New categories • Line(Railroad, Road, Waterway, Tunnel Bridge …) • Product (Vehicle, Food, Cloth, Weapon, Award …) • Event (Games, Conference, Natural Phenomena, War …) • Disease, Currency, God … • Era, Age, Color, Unit

  4. Development of ENE • Long time, steady development for years • Capital words in English newspaper (~2000) • Q&A, IE examples • Refer Encyclopedia, WordNet,,, • Refer Related work, Related systems • 100->140->200->210 • Used in IE and Q&A system and refine the definition • http://nlp.cs.nyu.edu/ene

  5. What is Named Entity? • Name is only a label • Properties and Attributes are the essential meaning • “Hudson River” is still “Hudson River” even if people call it “Muh-he-kun-ne-tuk” • Meaning of the entity can discerned from • “the river is in New York State” • “It is 507 km in length” • “It runs Adirondack Mountains to Upper New York Bay” • Name is only a label which can be used to refer to the river

  6. Attributes • “River” has attributes such as “source location”, “outflow”, “length” and so on • “People” has attributes such as “occupation”, ”birth date”, “nationality” and so on • Design those attributes and construct the knowledge will be very useful on the applications of NLP technologies • Q&A, IE, IR, Dialogue, co-reference…

  7. Design of the attributes • We use encyclopedia • Encyclopedia is the knowledge archive of named entities (dictionary for common words) • Description must contain many attributes • We will extract attributes from description of named entities (samples) and compile general attributes for each category

  8. Procedure • Extract (up to 50) sample name entity instances for each categories. We use a famous Japanese Encyclopedia, “Nippon Daihyakka (Nipponica)” published by Shogakkan Inc. • Annotators extract possible attribute values from description of the samples, and name the attribute label (Attribute values must be a noun phrase or equivalent) • Unify the attribute labels and identify the important (essential and mandatory) attributes for each category • Redesign the ENE categories • Construct a set of attributes

  9. Attributes for Person

  10. Attributes for International Organization

  11. Problemswe encountered and/or we haven’t solved yet • Entity dependent attributes ex) Song/Poem of river, “Loreley” on “Rhine River” • Fineness of attribute ex) Bird’s “color of head” or “color of body” • Span of value expression Longer than a noun phrase, ex) definition • Structure in value ex) Museum’s exhibit has own attributes (author, year) • ENE category definition Attributes are useful to define categories, but not always • Distinction of mandatory and optional Distinction of Property and attribute

  12. Inter-annotator Agreement • 2 annotator work on Person, Landform, International Organization and Academy • They agree more often on attributes which have values very often • They disagree the span of values

  13. Summary • Design Attributes on Extended Named Entity • Attributes are important in applications • Created based on Encyclopedia description • Document available (in Japanese, English in progress) • Dictionary / Tagger in development • http://nlp.cs.nyu.edu/ene

  14. Application • Q&A/IR • What is the 15th highest mountain in the world • How many mountains are there which is higher than 6000m • Tell me the major league player from New York • I met Satoshi Sekine from New York • Document understanding • “Yankees came back home!!” • “I visited the Marakech’s main sightseeing places”

More Related