Exploring audio semantic concepts for event-based video retrieval

Wang, Yipei; Rawat, Shourabh; Metze, Florian

doi:10.1184/R1/6473366.v1

file.pdf (205.33 kB)

Exploring audio semantic concepts for event-based video retrieval

journal contribution

posted on 2014-05-01, 00:00 authored by Yipei Wang, Shourabh Rawat, Florian MetzeFlorian Metze

The audio semantic concepts (sound events) play important roles in audio-based content analysis. How to capture the semantic information effectively from the complex occurrence pattern of sound events in YouTube quality videos is a challenging problem. This paper presents a novel framework to handle the complex situation for semantic information extraction in real-world videos and evaluate through the NIST multimedia event detection task (MED). We calculate the occurrence confidence matrix of sound events and explore multiple strategies to generate clip-level semantic features from the matrix. We evaluate the performance using TRECVID2011 MED dataset. The proposed method outperforms previous HMM-based system. The late fusion experiment with the low-level features and text feature (ASR) shows that audio semantic concepts capture complementary information in the soundtrack.

History

Publisher Statement

© 2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

Date

2014-05-01

Usage metrics

Keywords

multimedia retrieval audio processing semantic concept

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Exploring audio semantic concepts for event-based video retrieval

History

Publisher Statement

Date

Usage metrics

Categories

Keywords

Licence

Exports