The CMU-Oxford Translation System for the NIST Open Machine Translation 2012 Evaluation

Dyer, Chris; A. Smith, Noah; Morehead, Graham; Blunsom, Phil; Levenberg, Abby

doi:10.1184/R1/6473738.v1

file.pdf (305.86 kB)

The CMU-Oxford Translation System for the NIST Open Machine Translation 2012 Evaluation

journal contribution

posted on 2012-05-01, 00:00 authored by Chris Dyer, Noah A. Smith, Graham Morehead, Phil Blunsom, Abby Levenberg

The core of our system is the hierarchical phrase-based translation model (Chiang, 2007), as implemented by the cdec decoder (Dyer et al., 2010).1 A 4-gram language model estimated using modi- fied Kneser-Ney smoothing was included (Chen and Goodman, 1999). Translation model features include the log relative frequency, log f(e | k), the log counts of k and e, k, the log “lexical translation” probabilities in both directions, indicator features for rule counts of 1. Translation model parameters were tuned using the dynamic programming variant of minimum error rate training for hypergraphs to maximize the bleu score on a held-out development set with a single reference translation (Kumar et al., 2009; Papineni et al., 2002).

History

Date

2012-05-01

Usage metrics

Keywords

Language Technologies

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

The CMU-Oxford Translation System for the NIST Open Machine Translation 2012 Evaluation

History

Date

Usage metrics

Categories

Keywords

Licence

Exports