CMU TDT Report 12-13 November 2001

CMU TDT Report 12-13 November 2001 The CMU TDT Team: Jaime Carbonell, Yiming Yang, Ralf Brown, Chun Jin, Jian Zhang Language Technologies Institute, CMU

Time Line for TDT Activities • (Re)Start: Summer 2001 • Baseline FSD, Link, Det: Sept 2001 • Evaluation (of baseline): Oct 2001 • New Techniques: Nov 2001 – Onwards • Topic-conditional Novelty • Situated NE’s (all tasks) • Source-conditional interpolated training

Baseline FSD Method • (Unconditional) Dissimilarity with Past • Decision threshold on most-similar story • (Linear) temporal decay • Length-filter (for teasers) • Cosine similarity with standard weights:

FSD Results

Comparative FSD DET Curves

FSD Observations • Cross-site comparable baselines (cost =.7) • Data/labeling issues (from error analysis) • “Events-vs-Topics” issue (e.g. Asia crisis) • A few mislabled stories wreak havoc for FSD • Eager auto-segmentation a problem (misses) • Recommendations for TDT labeling • FSD on true events, or events within topic(s) • Change auto-segmentation optimality criterion ?? • Recommendations for TDT reserachers • Keep working hard on FSD – not cracked yet

New FSD Directions • Topic-conditional models • E.g. “airplane,” “investigation,” “FAA,” “FBI,” “casualties,”  topic, not event • “TWA 800,” “March 12, 1997”  event • First categorize into topic, then use maximally-discriminative terms within topic • Rely on situated named entities • E.g. “Arcan as victim,” “Sharon as peacemaker”

A New Approach to First Story Detection for TDT

Baseline Story-Link Detection • Use same term-weighting and cosine similarity as FSD and detection • Decision Thresholds conditioned on language and source • Lower threshold for cross-language • Lower threshold cross-ASR/newswire • Thresholds trained on development set • 15% improvement over universal threshold

Primary Link

CMU Link

CMU2 Link

CMU Detection Incremental Retrospective Clustering Group-Average in Forward Deferral Window Same cosine similarity and terms weight as FSD

CMU TDT Report 12-13 November 2001

CMU TDT Report 12-13 November 2001

Presentation Transcript

Overview of the TDT 2001 Evaluation and Results

November 1, 2001

12-13 November 2010

Food Safety Training November 12, 13

CMU TEAM-A in TDT 2004 Topic Tracking

CPMT Treasurer’s Report 10 November 2001

FASB 2001 Report FEI CFRI, November 2001

Breakthrough in New Chemical Materials Workshop, 12-13 November, 2001, Italia

CMU

13 June 2001

12 – 13 November 2009

CMU at TDT 2004 — Novelty Detection

CMU TDT Report TIDES PI Meeting 2002

GPM Applications Workshop 12-13.November.2013

CMU - March 1, 2001

Report on Monday Executive Committee Meeting, November 12, 2001

Thesis presentation Yakham NDIAYE November, 13 the 2001

Daily Technical Report:13 November 2018

ESPON SEMINAR Evora, 12-13 november, 2007

GPM Applications Workshop 12-13.November.2013

TDT 4242