file.pdf (305.86 kB)

The CMU-Oxford Translation System for the NIST Open Machine Translation 2012 Evaluation

Download (305.86 kB)
journal contribution
posted on 01.05.2012 by Chris Dyer, Noah A. Smith, Graham Morehead, Phil Blunsom, Abby Levenberg

The core of our system is the hierarchical phrase-based translation model (Chiang, 2007), as implemented by the cdec decoder (Dyer et al., 2010).1 A 4-gram language model estimated using modi- fied Kneser-Ney smoothing was included (Chen and Goodman, 1999). Translation model features include the log relative frequency, log f(e | k), the log counts of k and e, k, the log “lexical translation” probabilities in both directions, indicator features for rule counts of 1. Translation model parameters were tuned using the dynamic programming variant of minimum error rate training for hypergraphs to maximize the bleu score on a held-out development set with a single reference translation (Kumar et al., 2009; Papineni et al., 2002).

History

Date

01/05/2012

Exports

Exports