Lattice Based Language Models

Rosenfeld, Roni

doi:10.1184/R1/6606791.v1

Lattice Based Language Models

journal contribution

posted on 2010-06-24, 00:00 authored by Roni Rosenfeld

This paper introduces lattice based language models, a new language modeling paradigm. These models construct multi-dimensional hierarchies of partitions and select the most promising partitions to generate the estimated distributions. We discussed a specific two dimensional lattice and propose two primary features to measure the usefulness of each node: the training-set history count and the smoothed entropy of its prediction. Smoothing techniques are reviewed and a generalization of the conventional backoff strategy to multiple dimensions is proposed. Preliminary experimental results are obtained on the SWITCHBOARD corpus which lead to a 6.5 % perplexity reduction over a word trigram model.

History

Date

2010-06-24

Usage metrics

Keywords

speech recognition statistical language modeling lattice based models smoothing techniques Information and Computing Sciences not elsewhere classified

Licence

In Copyright

Lattice Based Language Models

History

Date

Usage metrics

Categories

Keywords

Licence

Exports