140 likes | 271 Views
A Language Modeling Approach to Tracking. Martijn Spitters & Wessel Kraaij TNO - TPD. TDT2000 at NIST, Gaithersburg. Outline. Introduction: Filtering vs. Tracking Tracking Model Experiments: stemming Influence of normalisation steps merging training stories Conclusions.
E N D
A Language Modeling Approach to Tracking Martijn Spitters & Wessel Kraaij TNO - TPD TDT2000 at NIST, Gaithersburg
Outline • Introduction: • Filtering vs. Tracking • Tracking Model • Experiments: • stemming • Influence of normalisation steps • merging training stories • Conclusions
Adaptive filtering vs. tracking • Adaptive filtering: • initial short topic statement, relevance judgements available ‘along the way’ • Individual threshold adaptation • Evaluation: Utility function • Tracking: • 1-4 training stories, no other relevance information • uniform threshold • Evaluation: Normalized detection Cost
Changes in system • Model based on P(S|T) instead of P(Q|D) ==>reversed orientation • Uniform threshold requires careful normalization • Likelihood ratio • Story length normalization • Gaussian normalization
Normalization Story length Gaussian:
Experiment 3: Merging Influence of the unbiased training story merging method on four TDT2 topics
Conclusions • simple language models are effective • Normalisation is important • Stemming is important • Unbiased training story merging procedure seems to help when training story lengths differ substantially
Future plans • Apply EM to learn optimal i • Refine background models • Apply expansion techniques • Unsupervised adaptation