Annotating News Video with Locations

Yang, Jun; Hauptmann, Alexander

doi:10.1184/R1/6603521.v1

file.pdf (1.28 MB)

Annotating News Video with Locations

journal contribution

posted on 2007-10-01, 00:00 authored by Jun Yang, Alexander Hauptmann

The location of video scenes is an important semantic descriptor especially for broadcast news video. In this paper, we propose a learning-based approach to annotate shots of news video with locations extracted from video transcript, based on features from multiple video modalities including syntactic structure of transcript sentences, speaker identity, temporal video structure, and so on. Machine learning algorithms are adopted to combine multi-modal features to solve two sub-problems: (1) whether the location of a video shot is mentioned in the transcript, and if so, (2) among many locations in the transcript, which are correct one(s) for this shot. Experiments on TRECVID dataset demonstrate that our approach achieves approximately 85% accuracy in correctly labeling the location of any shot in news video.

History

Date

2007-10-01

Usage metrics

Keywords

computer sciences

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Annotating News Video with Locations

History

Date

Usage metrics

Categories

Keywords

Licence

Exports