Minimum Classification Error Training in Exponential Language Models

Paciorek, Chris; Rosenfeld, Roni

doi:10.1184/R1/6607304.v1

file.pdf (58.79 kB)

Minimum Classification Error Training in Exponential Language Models

journal contribution

posted on 1995-05-01, 00:00 authored by Chris Paciorek, Roni Rosenfeld

Minimum Classification Error (MCE) training is difficult to apply to language modeling due to inherent scarcity of training data (N-best lists). However, a whole-sentence exponential language model is particularly suitable for MCE training, because it can use a relatively small number of powerful features to capture global sentential phenomena. We review the model, discuss feature induction, find features in both the Broadcast News and Switchboard domains, and build an MCE-trained model for the latter. Our experiments show that even models with relatively few features are prone to overfitting and are sensitive to initial parameter setting, leading us to examine alternative weight optimization criteria and search algorithms.

History

Date

1995-05-01

Usage metrics

Keywords

computer sciences

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Minimum Classification Error Training in Exponential Language Models

History

Date

Usage metrics

Categories

Keywords

Licence

Exports