Query-by-Example Spoken Term Detection on Multilingual Unconstrained Speech

Anguera, Xavier; Rodriguez-Fuentes, Luis J.; Szoke, Igor; Buzo, Andi; Metze, Florian; Penagarikano, Mikel

doi:10.1184/R1/6473642.v1

file.pdf (401.78 kB)

Query-by-Example Spoken Term Detection on Multilingual Unconstrained Speech

journal contribution

posted on 2014-09-01, 00:00 authored by Xavier Anguera, Luis J. Rodriguez-Fuentes, Igor Szoke, Andi Buzo, Florian MetzeFlorian Metze, Mikel Penagarikano

As part of the MediaEval 2013 benchmark evaluation campaign, the objective of the Spoken Web Search (SWS) task was to perform Query-by-Example Spoken Term Detection (QbESTD) using audio queries in a low-resource setting. After two successful editions and a continuously growing interest in the scientific community, a special effort was made in SWS 2013 to prepare a challenging database, including speech in 9 different languages with diverse environment and channel conditions. In this paper, first we describe the database and the performance metrics. Then, we briefly review the algorithmic approaches followed by participants and present and discuss the obtained performances, which demonstrate the feasibility of the proposed task, even under such challenging conditions (multiple languages and unconstrained acoustic conditions). Finally, we analyze the fusion of the top-performing systems, which achieved a 30% relative improvement over the best single system in the evaluation, proving that a variety of approaches can be effectively combined to bring complementary information in the search for queries.

History

Date

2014-09-01

Usage metrics

Keywords

benchmark evaluation low-resource languages query-by-example spoken term detection pattern matching

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Query-by-Example Spoken Term Detection on Multilingual Unconstrained Speech

History

Date

Usage metrics

Categories

Keywords

Licence

Exports