Identification and modeling of word fragments in spontaneous speech

Tsvetkov, Yulia; Sheikh, Zaid A.W.; Metze, Florian

doi:10.1184/R1/6473402.v1

file.pdf (109.68 kB)

Identification and modeling of word fragments in spontaneous speech

journal contribution

posted on 2013-05-01, 00:00 authored by Yulia Tsvetkov, Zaid A.W. Sheikh, Florian MetzeFlorian Metze

This paper presents a novel approach to handling disfluencies, word fragments and self-interruption points in Cantonese conversational speech. We train a classifier that exploits lexical and acoustic information to automatically identify disfluencies during training of a speech recognition system on conversational speech, and then use this classifier to augment reference annotations used for acoustic model training. We experiment with approaches to modeling disfluencies in the pronunciation dictionary, and their effect on the polyphonic decision tree clustering. We achieve automatic detection of disfluencies with 88% accuracy, which leads to a reduction in character error rate of 1.9% absolute. While the high baseline error rates are due to the task we are currently working on, we demonstrate that this approach works well on the Switchboard corpus, for which the conversational nature of speech is also a major problem.

History

Publisher Statement

© 2013 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

Date

2013-05-01

Usage metrics

Keywords

speech recognition conversational speech word fragments identification disfluency modeling reference annotation

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Identification and modeling of word fragments in spontaneous speech

History

Publisher Statement

Date

Usage metrics

Categories

Keywords

Licence

Exports