Carnegie Mellon University
Browse

Voting on N-grams for Machine Translation System Combination

Download (135.73 kB)
journal contribution
posted on 2010-10-01, 00:00 authored by Kenneth Heafield, Alon LavieAlon Lavie

System combination exploits differences between machine translation systems to form a combined translation from several system outputs. Core to this process are features that reward n-gram matches between a candidate combination and each system output. Systems differ in performance at the n-gram level despite similar overall scores. We therefore advocate a new feature formulation: for each system and each small n, a feature counts n-gram matches between the system and candidate. We show post-evaluation improvement of 6.67 BLEU over the best system on NIST MT09 Arabic-English test data. Compared to a baseline system combination scheme from WMT 2009, we show improvement in the range of 1 BLEU point.

History

Publisher Statement

Copyright 2010 AMTA

Date

2010-10-01

Usage metrics

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC