1 / 7

Comprehensive Survey of Annotation Work in Natural Language Processing

This document presents a detailed overview of annotation work conducted in NLP, focusing on various phenomena such as noun senses, verb frames, time expressions, and discourse analysis. The session, chaired by Eduard Hovy, features contributions from experts across institutions, discussing frameworks like PropBank, OntoBank, FrameNet, and more. With a focus on speed, reliability, and accuracy, the findings highlight methodologies in information extraction, discourse structure, and opinion detection, emphasizing collaborative efforts and advancements in semantic annotation.

marlon
Télécharger la présentation

Comprehensive Survey of Annotation Work in Natural Language Processing

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Survey of Annotation Work Joint session Thursday afternoon, April 14 Chair: Eduard Hovy, ISI

  2. Phenomena (from OntoBank)

  3. Notional goal phenomenon annot annot functionality funder speed reliability need • noun senses 25 wph 86/90% IE,MT,QA... high • verb senses 70 wph ~87% MT,QA,WSD high • verb frames 80 w/week 87% MT,QA,IE… high • time exprs 18 wpm 96% QA,IR,Summ med-hi • discourse 100K in 400h ~90/80% Summ,QA med • gazetteers ? ~95/90% QA,IE high • opinions 100K in 400h ~76% QA,Summ med-hi • number exprs ? ? IE,QA,Summ med • hypotheticals ? ? QA,Summ low?

  4. Agenda I • Predicate/verb level: • PropBank I and II: Martha Palmer, UPenn • OntoBank corefs: Lance Ramshaw, BBN • IAMTC consortium: Steve Helmreich, NMSU • FrameNet: Charles Fillmore, UC Berkeley • Extended LCS: Bonnie Dorr, U Maryland • Nominal level: • NomBank: Adam Meyers, NYU • ACE: Ralph Grishman, NYU • Terminology banks: • WordNet: Christiane Fellbaum, Princeton • Omega: Eduard Hovy, USC/ISI to PropBank to OntoBank coref to IAMTC to Framenet to LCS to NomBank and Pie-in-the-Sky to ACE to WordNetPlus to Omega

  5. Agenda II • Discourse level: • RST treebank: Lynn Carlson, DoD • Penn discourse treebank: Aravind Joshi, UPenn • Specific semantic phenomena: • TIMEX: Lisa Ferro, MITRE & Beth Sundheim, SPAWAR • ILIT: Sergei Nirenburg, UMBC • Opinions: Jan Wiebe, U Pitt • Gazetteers: Beth Sundheim, SPAWAR • Inference and reasoning: • WN Entailments: Christiane Fellbaum, Princeton • CYC: Dave Schneider • Scone: Scott Fahlman to RST to Penn discourse to TIMEX to ILIT to opinions to gazetteers to WN entailments to CYC to Scone

  6. Summary of annot work

More Related