Integration of Language Identification into a Recognition System for Spoken Conversations Containing Code-Switches

Weiner, Jochen; Vu, Ngoc Thang; Telaar, Dominic; Metze, Florian; Schultz, Tanja; Lyu, Dau-Chen; Chng, Eng-Siong; Li, Haizhou

doi:10.1184/R1/6473456.v1

file.pdf (157.14 kB)

Integration of Language Identification into a Recognition System for Spoken Conversations Containing Code-Switches

journal contribution

posted on 2012-05-01, 00:00 authored by Jochen Weiner, Ngoc Thang Vu, Dominic Telaar, Florian MetzeFlorian Metze, Tanja Schultz, Dau-Chen Lyu, Eng-Siong Chng, Haizhou Li

This paper describes the integration of language identification (LID) into a multilingual automatic speech recognition (ASR) system for spoken conversations containing code-switches between Mandarin and English. We apply a multistream approach to combine at frame level the acoustic model score and the language information, where the latter is provided by an LID component. Furthermore, we advance this multistream approach by a new method called “Language Lookahead”, in which the language information of subsequent frames is used to improve accuracy. Both methods are evaluated using a set of controlled LID results with varying frame accuracies. Our results show that both approaches improve the ASR performance by at least 4% relative if the LID achieves a minimum frame accuracy of 85%.

History

Publisher Statement

Date

2012-05-01

Usage metrics

Keywords

code-switching multi-stream combination language lookahead

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Integration of Language Identification into a Recognition System for Spoken Conversations Containing Code-Switches

History

Publisher Statement

Date

Usage metrics

Categories

Keywords

Licence

Exports