posted on 2008-06-01, 00:00authored byMark Derthik, Michael G Christel, Alexander Hauptmann, Dorbin Ng, Scott Stevens, Howard WactlarHoward Wactlar
CMU’s Informedia project has collected and automatically processed a multi-terabyte video corpus
containing 8 years of CNN broadcasts and other video sources [5]. Previous work has demonstrated
multi-modal querying by text, image, time, and location, and the ability to summarize a single document
or a set of documents matching a query. We now plan to organize the corpus or a subset along multiple
dimensions, or perspectives, adding relevant background material, significantly expanding and
accelerating the viewer’s comprehension and integration of knowledge. A perspective can provide factual
background information, a history of an issue, the view of a biased source, a technical or medical
perspective, or any of dozens of others. This abstract proposes a cityscape metaphor for organizing visual
context in terms of perspectives.