1 / 4

Suggestions for Class Projects

Suggestions for Class Projects. Systematic study of phrase table generation heuristics (experimentation) use Moses pipeline, different heuristics, systematic changes of alignment density, alignment quality, corpus size to extract phrase pairs

eliza
Télécharger la présentation

Suggestions for Class Projects

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Suggestions for Class Projects • Systematic study of phrase table generation heuristics (experimentation) • use Moses pipeline, different heuristics, systematic changes of alignment density, alignment quality, corpus size to extract phrase pairs • Comparison of Moses phrase-based and hierarchical decoding (experimentation) • experiment with at least 2 language pairs; analyze the differences • Compare GIZA and Berkeley aligner (experimentation) • compare AER; impact on phrase table; impact on resulting translation • Word alignment for language without word boundaries (experimentation) • can standard work alignment models help to detect word/morpheme boundaries; experiment with simple (e.g. Spanish) and difficult language (e.g Inupiac)

  2. Suggestions for Class Projects • Detect discontinuous phrase pairs (implementation) • analyze dis-contiguous phrases in hand aligned data; implement extension to PESA aligner to detect dis-contiguous phrases • Use Parts-of-Speech features to improve phrase alignment (implementation) • evaluate phrase alignment quality; implement extension to PESA aligner; evaluate impact on translation

  3. Suggestions for Class Projects • Sentence-level Confidence Estimation of MT quality, possibly including identifying poor MT translations • Tuning an MT system to different automatic metrics (including the new METEOR-tuning), and comparing the outcomes • Learning DNT (Do Not Translate) Lists and Incorporating them into Moses • Improving a Baseline Moses MT System by:      - Filtering out bad word-alignments      - Filtering the phrase-table      - Adding decoding features • Building a Hierarchical or Syntax-based MT system for any language-pair

More Related