posted on 2001-06-01, 00:00authored byAlexander Hauptmann, Michael G Christel, Wei-Hao Lin, Bryan Maher, Jun Yang, Robert V. Baron, Guang Xiang
For the first time in 2007, TRECVID considered structured
evaluation of automated video summarization, utilizing BBC
rushes video. This paper discusses in detail our approaches for
producing the submitted summaries to TRECVID, including the
two baseline methods. The cluster method performed well in
terms of coverage, and adequately in terms of user satisfaction,
but did take longer to review. We conducted additional
evaluations using the same TRECVID assessment interface to
judge 2 additional methods for summary generation: 25x (simple
speed-up by 25 times), and pz (emphasizing pans and zooms).
Data from 4 human assessors shows significant differences
between the cluster, pz, and 25x approaches. The best coverage
(text inclusion performance) is obtained by 25x, but at the
expense of taking the most time to evaluate and perceived as the
most redundant. Method pz was easier to use than cluster and had
better performance on pan/zoom recall tasks, leading into
discussions on how summaries can be improved with more
knowledge of the anticipated users and tasks.