Applying SPHINX-II to the DARPA Wall Street Journal CSR Task

Alleva, F; Hon, H; Huang, M; Rosenfeld, Ronald; Weide, R

doi:10.1184/R1/21709916.v1

Applying SPHINX-II to the DARPA Wall Street Journal CSR Task

conference contribution

posted on 2023-03-01, 18:55 authored by F Alleva, H Hon, M Huang, Ronald RosenfeldRonald Rosenfeld, R Weide

This paper reports recent efforts to apply the speaker-independent SPHINX-II system to the DARPA Wall Street Journal continuous speech recognition task. In SPHINX-II, we incorporated additional dynamic and speaker-normalized features, replaced discrete models with sex-dependent semi-continuous hidden Markov models, augmented within-word triphones with between-word triphones, and extended generalized triphone models to shared-distribution models. The configuration of SPHINX-II being used for this task includes sex-dependent, semi-continuous, shared-distribution hidden Markov models and left context dependent between-word triphones. In applying our technology to this task we addressed issues that were not previously of concern owing to the (relatively) small size of the Resource Management task.

History

Publisher Statement

F. Alleva, H. Hon, X. Huang, M. Hwang, R. Rosenfeld, and R. Weide. 1992. Applying SPHINX-II to the DARPA Wall Street Journal CSR task. In Proceedings of the workshop on Speech and Natural Language (HLT '91). Association for Computational Linguistics, USA, 393–398.

Date

1992-02-01

Usage metrics

Keywords

speech recognition SPHINX-II Computer-Human Interaction

Licence

In Copyright

Applying SPHINX-II to the DARPA Wall Street Journal CSR Task

History

Publisher Statement

Date

Usage metrics

Categories

Keywords

Licence

Exports