Topic Detection and Tracking Pilot Study Final Report

<p>Topic Detection and Tracking (TDT) is a DARPA-sponsored initiative to investigate the state of the art in finding and following new events in a stream of broadcast news stories. The TDT problem consists of three major tasks: (1) segmenting a stream of data, especially recognized speech, into distinct stories; (2) identifying those news stories that are the first to discuss a new event occurring in the news; and (3) given a small number of sample news stories about an event, finding all following stories in the stream. The TDT Pilot Study ran from September 1996 through October 1997. The primary participants were DARPA, Carnegie Mellon University, Dragon Systems, and the University of Massachusetts at Amherst. This report summarizes the findings of the pilot study. The TDT work continues in a new project involving larger training and test corpora, more active participants, and a more broadly defined notion of “topic” than was used in the pilot study. </p> <p>The following individuals participated in the research reported. </p> <p>James Allan, UMass </p> <p>Brian Archibald, CMU </p> <p>Doug Beeferman, CMU </p> <p>Adam Berger, CMU </p> <p>Ralf Brown, CMU </p> <p>Jaime Carbonell, CMU </p> <p>Ira Carp, Dragon </p> <p>Bruce Croft, UMass, </p> <p>George Doddington, DARPA </p> <p>Larry Gillick, Dragon </p> <p>Alex Hauptmann, CMU </p> <p>John Lafferty, CMU </p> <p>Victor Lavrenko, UMass </p> <p>Xin Liu, CMU </p> <p>Steve Lowe, Dragon </p> <p>Paul van Mulbregt, Dragon </p> <p>Ron Papka, UMass </p> <p>Thomas Pierce, CMU </p> <p>Jay Ponte, UMass </p> <p>Mike Scudder, UMass </p> <p>Charles Wayne, DARPA </p> <p>Jon Yamron, Dragon </p> <p>Yiming Yang, CMU</p>