Language Modeling with Power Low Rank Ensembles

P. Parikh, Ankur; Saluja, Avneesh; Dyer, Chris; P Xing, Eric

doi:10.1184/R1/6475838.v1

Language Modeling with Power Low Rank Ensembles

journal contribution

posted on 2014-10-01, 00:00 authored by Ankur P. Parikh, Avneesh Saluja, Chris Dyer, Eric P Xing

We present power low rank ensembles (PLRE), a flexible framework for n-gram language modeling where ensembles of low rank matrices and tensors are used to obtain smoothed probability estimates of words in context. Our method can be understood as a generalization of ngram modeling to non-integer n, and includes standard techniques such as absolute discounting and Kneser-Ney smoothing as special cases. PLRE training is efficient and our approach outperforms state-of-the-art modified Kneser Ney baselines in terms of perplexity on large corpora as well as on BLEU score in a downstream machine translation task.

History

Date

2014-10-01

Usage metrics

Keywords

Machine Learning Knowledge Representation and Machine Learning

Licence

In Copyright

Language Modeling with Power Low Rank Ensembles

History

Date

Usage metrics

Categories

Keywords

Licence

Exports