Description of acoustic variations by hidden Markov models with tree structure

Hayamizu, Satoru; Lee, Kai-Fu; Hon, Hsiao-Wuen

doi:10.1184/R1/6604754.v1

file.pdf (1.07 MB)

Description of acoustic variations by hidden Markov models with tree structure

journal contribution

posted on 2007-12-01, 00:00 authored by Satoru Hayamizu, Kai-Fu Lee, Hsiao-Wuen Hon

Abstract: "This paper provides a description of the acoustic variations of speech and its application to a speech recognition system using hidden Markov models. There are many sources of variabilities that affect the realization of a phoneme: phonetic contexts, speakers, stress, speaking rates and so on. Explicit modeling with these sources of variabilities will give more accurate and more detailed phone models, but even with a large amount of speech data, it is necessary to put some structure to the description for robustness. Tree-based HMMs are discussed as one of such structures.Three case studies are presented: HMMs with large VQ codebook sizes, decision tree clustering and speaker-clustering. They are tested on speaker-independent continuous speech recognition experiments with a 1,000 word vocabulary. Trainability and generalizability are discussed based on the experimental results."

History

Date

2007-12-01

Usage metrics

Keywords

Speech processing systems.Automatic speech recognition.

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Description of acoustic variations by hidden Markov models with tree structure

History

Date

Usage metrics

Categories

Keywords

Licence

Exports