Multilingual Tracking Models: Unsupervised & Supervised Techniques at UMASS-Amherst TDT (2004)

UMASS-Amherst at TDT 2004 Unsupervised and SupervisedTracking Hema Raghavan

Outline • Create a training corpus • Unsupervised tracking • Supervised Tracking • Discussion

Creating a training corpus • For Tracking • 50% topics are English • 50% are multilingual • Created a training corpus (supervised and unsupervised) • 30 topics from TDT4 • 50% stories with primarily English topics. • 50% multilingual stories

Unsupervised Tracking Ideas Ideas • Models • Vector Space • Relevance Models • Adaptation • Native Language comparisons

Unsupervised Tracking Models • Vector Space • TF-IDF • IDF is incremental • Relevance Models • State of the art, high performance system • Adaptation

Native Language Hypothesis • TDT tasks involve comparisons of models: • Story link detection: sim(Si, Sj) • Topic tracking: sim(Si, Tj) • It is more effective to measure similarity between models in the original language of the stories, than after machine translation into English • Quality of translation • Differences in score distributions • Trivially obvious? Hard to demonstrate in tracking

Topic tracking with Native Models [SIGIR 2004]

Unsupervised Tracking Results(training set: nwt+TDT4)

Submitted Runs • TF-IDF (UMASS4) • TF-IDF + adaptation (UMASS1) • TF-IDF + adaptation + native models (UMASS2) • Relevance Models + adaptation (UMASS5) • All submissions for primary evaluation condition.

Unsupervised Tracking Results

Supervised Tracking • Creating a newswire only training corpus. • Ideas • Models • Vector Space • Relevance Models • Native Language comparisons • Incremental Thresholds • Negative Feedback

Incremental Thresholds • Utility • Relevance judgments for both Hits and False-Alarms • Increment the YES/NO threshold by when Utility falls below zero.

Negative Feedback • Relevance judgments for both Hits and False-Alarms • for a hit. • for a false alarm.

From Unsupervised to Supervised

Native Language Comparisons

Submitted Runs • Rel. Models (UMASS-2) • Optimized for TDT cost • Rel. Models + Inc. Thresholds (UMASS-1) • TF-IDF + adaptation + neg. feedback + inc thresholds (UMASS-3) • TF-IDF + adaptation + native models (UMASS-4) • TF-IDF + adaptation + native models + neg feedback + increase thresh. (UMASS-7) Optimized for T11SU

Supervised Tracking Results Cost: 0.0467

Results and Discussion • Supervision clearly helps. • Relevance models – a clear winner. • Negative Feedback helps. • Training set did not reflect test very well. • Min-cost versus T11SU

Future Work • Exploration Exploitation trade-off. • What about feedback that is less on demand? • more realistic • Can add costs for judgments. • What about feedback like in the HARD task – Clarification forms?

Multilingual Tracking Models: Unsupervised & Supervised Techniques at UMASS-Amherst TDT (2004)

Multilingual Tracking Models: Unsupervised & Supervised Techniques at UMASS-Amherst TDT (2004)

Presentation Transcript

Algorithms for Distributed Supervised and Unsupervised Learning

On the Power of Ensemble: Supervised and Unsupervised Methods Reconciled*

Supervised and unsupervised wrapper generation

Unsupervised and Weakly-Supervised Probabilistic Modeling of Text

Supervised learning vs. unsupervised learning

Supervised and Unsupervised learning for Natural language processing

Scalable Methods for Graph-Based Unsupervised and Semi-Supervised Learning

Stochastic k- Neighborhood Selection for Supervised and Unsupervised Learning

Lab 5 Unsupervised and supervised clustering

CMUDIR group: TDT Supervised Tracking

Graph-based Consensus Maximization among Multiple Supervised and Unsupervised Models

Unsupervised and Semi-Supervised Learning of Tone and Pitch Accent

Classification Supervised and unsupervised

Supervised and unsupervised methods for large scale genomic data integration

Unsupervised and weakly-supervised discovery of events in video (and audio)

Unsupervised Word Sense Disambiguation Rivaling Supervised Methods

Objectives: Adaptation Resources: RS: Unsupervised vs. Supervised

Graph-based Consensus Maximization among Multiple Supervised and Unsupervised Models

FAUST Classifiers FAUST = Fast, Analytic, Unsupervised and Supervised Technology

Supervised and Unsupervised MFA learning Self-organization Classification