1 / 25

Correcting Misuse of Verb Forms

Correcting Misuse of Verb Forms. John Lee , Stephanie Seneff Computer Science and Artificial Intelligence Laboratory,  MIT, Cambridge. ACL 2008. Outline. Introduction Background System Baselines Data Evaluation Conclusions. Introduction. Introduction.

mea
Télécharger la présentation

Correcting Misuse of Verb Forms

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Correcting Misuse of Verb Forms John Lee , Stephanie Seneff Computer Science and Artificial Intelligence Laboratory,  MIT, Cambridge ACL 2008

  2. Outline • Introduction • Background • System • Baselines • Data • Evaluation • Conclusions

  3. Introduction

  4. Introduction

  5. Introduction

  6. Outline • Introduction • Background • System • Baselines • Data • Evaluation • Conclusions

  7. Background The goal is to correct confusions among the five forms, as well as the infinitive caused by semantic and syntactic errors. Semantic Errors Suppose one wants to say “I am prepared for the exam”, but writes “I am preparing for the exam”.

  8. Background Syntactic Errors Subject-Verb Agreement He *have been living there since June. Auxiliary Agreement He has been *live there since June. Complementation He wants*live there.

  9. Outline • Introduction • Background • System • Baselines • Data • Evaluation • Conclusions

  10. System Step1 Automatic Parsing “My father is *work in the laboratory.”

  11. System Step2 Replacing the verb forms

  12. System

  13. System Step3 N-gram counts as a filter Using WEB 1T N-GRAM corpus. Prepared by Google Inc.

  14. Outline • Introduction • Background • System • Baselines • Data • Evaluation • Conclusions

  15. Baselines majority baseline No correction. verb-only baseline(Only used in Auxiliary Agreement & Complementation) It attempts corrections only when the word in question is actually tagged as a verb.

  16. Outline • Introduction • Background • System • Baselines • Data • Evaluation • Conclusions

  17. Data Development Data AQUAINT Corpus (English News Text) Evaluation Data JLE (Japanese Learners of English corpus) For 167 of the transcribed interviews, totalling 15,637 sentences. Test Set 477 sentences (3.1%) contain subject-verb agreement errors, and 238 (1.5%) contain auxiliary agreement and complementation errors

  18. Data Evaluation Data HKUST (Hong Kong University of Science and Technology) It contains a total of 2556 sentences.

  19. Data Evaluation Metric Accuracy (true neg + true pos) / total number of sentences Recall true pos / (true pos + false neg + inv pos) Detection Precision (true pos + inv pos) / (true pos + inv pos + false pos) Correction Precision true pos / (true pos + false pos + inv pos)

  20. Outline • Introduction • Background • System • Baselines • Data • Evaluation • Conclusions

  21. Evaluation JLE Results for Subject-Verb Agreement Results for Auxiliary Agreement & Complementation

  22. Evaluation HKUST Results for Auxiliary Agreement & Complementation Two native speakers of English were given the edited sentences, as well as the original input. For each pair, they were asked to select one of four statements: one of the two is better, or both are equally correct, or both are equally incorrect. Kappa: 0.76

  23. Evaluation

  24. Outline • Introduction • Background • System • Baselines • Data • Evaluation • Conclusions

  25. Conclusions • This paper proposes a method to correct English verb form • errors made by non-native speakers. • Investigation of the ways the ways in which verb form errors • affect parse trees.

More Related