Improving the Performance of an LVCSR System Through Ensembles of Acoustic Models

Zhang, Rong; Rudnicky, Alexander

doi:10.1184/R1/6606407.v1

file.pdf (49.14 kB)

Improving the Performance of an LVCSR System Through Ensembles of Acoustic Models

journal contribution

posted on 2003-08-01, 00:00 authored by Rong Zhang, Alexander RudnickyAlexander Rudnicky

This paper describes our work on applying ensembles of acoustic models to the problem of large vocabulary continuous speech recognition (LVCSR). We propose three algorithms for constructing ensembles. The first two have their roots in bagging algorithms; however, instead of randomly sampling examples our algorithms construct training sets based on the word error rate. The third one is a boosting style algorithm. Different from other boosting methods which demand large resources for computation and storage, our method present a more efficient solution suitable for acoustic model training. We also investigate a method that seeks optimal combination for models. We report experimental results on a large real world corpus collected from the Carnegie Mellon Communicator dialog system. Significant improvements on system performance are observed in that up to 15.56% relative reduction on word error rate is achieved.

History

Date

2003-08-01

Usage metrics

Keywords

computer sciences

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Improving the Performance of an LVCSR System Through Ensembles of Acoustic Models

History

Date

Usage metrics

Categories

Keywords

Licence

Exports