Carnegie Mellon University
Browse
file.pdf (63.84 kB)

Apply N-Best List Re-Ranking to Acoustic Model Combinations of Boosting Training

Download (63.84 kB)
journal contribution
posted on 2004-06-01, 00:00 authored by Rong Zhang, Alexander RudnickyAlexander Rudnicky

The object function for Boosting training method in acoustic modeling aims to reduce utterance level error rate. This is different from the most commonly used performance metric in speech recognition, word error rate. This paper proposes that the combination of N-best list re-ranking and ROVER can partly address this problem. In particular, model combination is applied to re-ranked hypotheses rather than to the original top-1 hypotheses and carried on word level. Improvement of system performance is observed in our experiments. In addition, we describe and evaluate a new confidence feature that measures the correctness of frame level decoding result.

History

Date

2004-06-01

Usage metrics

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC