A Language Modeling Approach to Tracking
140 likes | 275 Vues
This paper presents a detailed analysis of a language modeling approach applied to tracking in Topic Detection and Tracking (TDT) systems, specifically highlighting filtering versus tracking methods. Key experiments explore the impact of stemming, normalization steps, and the merging of training stories on tracking efficiency. Our findings indicate that simple language models can be highly effective, emphasizing the importance of normalization and stemming, and demonstrating that an unbiased merging procedure aids in handling varying training story lengths. Future research will focus on refining models and applying unsupervised adaptation techniques.
A Language Modeling Approach to Tracking
E N D
Presentation Transcript
A Language Modeling Approach to Tracking Martijn Spitters & Wessel Kraaij TNO - TPD TDT2000 at NIST, Gaithersburg
Outline • Introduction: • Filtering vs. Tracking • Tracking Model • Experiments: • stemming • Influence of normalisation steps • merging training stories • Conclusions
Adaptive filtering vs. tracking • Adaptive filtering: • initial short topic statement, relevance judgements available ‘along the way’ • Individual threshold adaptation • Evaluation: Utility function • Tracking: • 1-4 training stories, no other relevance information • uniform threshold • Evaluation: Normalized detection Cost
Changes in system • Model based on P(S|T) instead of P(Q|D) ==>reversed orientation • Uniform threshold requires careful normalization • Likelihood ratio • Story length normalization • Gaussian normalization
Normalization Story length Gaussian:
Experiment 3: Merging Influence of the unbiased training story merging method on four TDT2 topics
Conclusions • simple language models are effective • Normalisation is important • Stemming is important • Unbiased training story merging procedure seems to help when training story lengths differ substantially
Future plans • Apply EM to learn optimal i • Refine background models • Apply expansion techniques • Unsupervised adaptation