1 / 16

Increasing the coverage of answer extraction by applying anaphora resolution IS-LTC

Increasing the coverage of answer extraction by applying anaphora resolution IS-LTC October 10 2006 Jori Mur Humanities Computing University of Groningen. Outline. Background Question Answering (QA) Off-line answer extraction Anaphora resolution for answer extraction

Télécharger la présentation

Increasing the coverage of answer extraction by applying anaphora resolution IS-LTC

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Increasing the coverage of answer extraction by applying anaphora resolution IS-LTC October 10 2006 Jori Mur Humanities Computing University of Groningen

  2. Outline • Background • Question Answering (QA) • Off-line answer extraction • Anaphora resolution for answer extraction • Anaphora resolution technique for definite nouns • Anaphora resolution technique for pronouns • Experiment and Results • Conclusion

  3. Question Anwering (QA) • Task: Find an answer in a text collection to a question posed in a natural language. • Question: How old is John McEnroe?Answer: 35 years • Question: When was Hillary Clinton born?Answer: October 26 1947

  4. Off-line answer extraction • Use dependency parser to parse the corpus • Define dependency patterns • [Location Name] has [Number] inhabitants<have, subj, [Location Name]><have, obj, inhabitants><inhabitants, det, [Number]> • Match dependency relations of sentence from text with dependency pattern • Extract and save facts

  5. Text: McEnroe was injured on his right knee. [...] The problems with his knee kept bothering the 35-year old American for two weeks. Problem

  6. Anaphora resolution for definite nouns • Modify patterns to match definite nouns • [Definite noun] has [Number] inhabitants<have, subj, [Definite noun]><have, obj, inhabitants><inhabitants, det, [Number]> • Create instance list using predicate and apposition relation • Select first preceding name, check if it occurs together with the noun at the instance list • Fall back: select first preceding name

  7. Experiment • 12 question types • Age • Date of Birth • Location of Birth • Capital • Date of Death • Location of Death • Manner/Cause of Death • Age of Death • Founded • Function • Inhabitants • Winner

  8. Experiment • Clef corpus for Dutch: Two newspapers (Algemeen Dagblad and NRC Handelsblad) • 1994 and 1995 • Simple predefined dependency patterns and patterns based on anaphora resolution • 200 Dutch Questions of Clef-2005 • QA system: Joost

  9. Results for extraction • Around 10,900 fact-types extra

  10. Results for QA • 200 questions from Clef-2005 data-set

  11. Discussion of Results • Hypothesis 1: Precision should be increased. • Hypothesis 2: Selection of types was limited. • Hypothesis 3: Answers to questions occur in one sentence

  12. Answer in one sentence • Question 107: Who was the pilot of the mission that repaired the astronomic satelite, the Hubble Space Telescope? • Text AD19940719: Bowersox was the pilot of the mission that repaired the astronomic satelite, the Hubble Space Telescope.

  13. Conclusion • One way to improve the coverage of answer extraction is anaphora resolution • Although precision drops it doesn’t hurt the performance of QA. Result even improved. • It should be investigated what happens if the domain of question types on which anaphora resolution is applied is broadened • It should be investigated what happens if the questions are really independent of the corpus

  14. Questions?

  15. Anaphora resolution for pronouns • Modify patterns to match pronouns • [Pronoun] has [Number] inhabitants<have, subj, [Pronoun]><have, obj, inhabitants><inhabitants, det, [Number]> • Create list of boys and girls names (baby names site at the internet) • Select first preceding name, check if it does not occur on the list of the opposite sexe of the pronoun • Fall back: select first preceding name

  16. Text: NH19941209 35-year old McEnroe ... Question: How old is McEnroe ? Answer: 35 Example

More Related