Approximate Factoring for A* Search
350 likes | 375 Vues
This paper explores the use of approximate factoring to improve A* search in NLP tasks. The authors propose a heuristic design that is both tight and admissible, resulting in efficient computation. The paper presents experiments on bitext parsing and lexicalized parsing to demonstrate the effectiveness of the proposed approach.
Approximate Factoring for A* Search
E N D
Presentation Transcript
Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley
Inference for NLP Tasks A* Search
Inference as Search Partial Hypothesis a3 a2 a2 y a1
Bitext Parsing as Search S S S S’ NP VP NP VP Target Source translation is hard , la traducción es dificil Weighted Synchronous Grammar Parsing O(n6) Modified CKY over bi-spans (X[i,j],X’[i’,j’])
A* Search y Score So Far Completion Score
A* Search • Heuristic Design • Tight small • Admissible • Efficient to compute This way hypothesis! Optimal Result A* Heuristic Man
A* Example: Bitext Search Bi-Span Viterbi Inside Score Cost So Far
A* Bitext Search Viterbi Outside Score Completion Score Ideal Heuristic O(n6)
Of Stately Projections ¼ S’ S S S’ S S’ NP’ VP’ NP VP S’ S NP’ VP’ NP VP S S’ S S
A* Bitext Search S S S S’ S S’ NP VP NP VP NP VP NP’ VP’ Suppose, Then,
Projection Heuristic O(n6) O(n3) O(n3) Klein and Manning [2003]
When models don’t factorize c(a) y x Át(a) Ás(a) ¼s(y) ¼t(y) ¼t(x) ¼s(x) Pointwise Admissibility
When models don’t factorize y ¼t(y) ¼s(y) Admissibility
Finding Factored Costs How to find Ás and Át? Pointwise Gap
Finding Factored Costs Small gaps
Finding Factored Costs Pointwise Admissibility
Bitext Experiments Synchronous Tree-to-Tree Transducer • Trained on 40k sentences of English-Spanish Europarl [Galley et. al, 2004] • Rare words replaced with POS tags • Tested on 1,200 sent. max length 5-15 Optimization Problem • Solved only once per grammar • 206K Variables • 160KConstraints • 29 minutes
Bitext Experiments Zhang and Gildea (2006)
Bitext Experiments Zhang and Gildea (2006)
Lexicalized Parsing S-(is,VBZ) VP-(is,VBZ) NP-(translation,NN) (is,VBZ) S (translation, NN) NP VP Klein and Manning [2003]
Lexicalized Parsing Too many constraints to efficiently solve! Over 64e13 possible lexicalized rules
Lexicalized Model Experiments Standard Setup • Train on section 2-21 of the treebank • Test on section 23 (length · 40) Models Tested • Factored model [Klein and Manning, 2003] • Non-Factored Model
Lexicalized Parsing Factored Model [Klein and Manning, 2003]
Lexicalized Parsing Non-Factored Model
Conclusions • Generaltechnique for generating A* estimates • Can explicitly control admissibility tightness trade-off • Future Work: Explore different objectives and applications
Thanks http://nlp.cs.berkeley.edu