file.pdf (624.01 kB)
Download file

Chunk-Based EBMT

Download (624.01 kB)
journal contribution
posted on 01.05.2010, 00:00 authored by Jae Dong Kim, Ralf D Brown, Jaime G. Carbonell

Corpus driven machine translation approaches such as Phrase-Based Statistical Machine Translation and Example-Based Machine Translation have been successful by using word alignment to find translation fragments for matched source parts in a bilingual training corpus. However, they still cannot properly deal with systematic translation for insertion or deletion words between two distant languages. In this work, we used syntactic chunks as translation units to alleviate this problem, improve alignments and show improvement in BLEU for Korean to English and Chinese to English translation tasks.

History

Publisher Statement

Copyright 2010 European Association for Machine Translation.

Date

01/05/2010

Usage metrics

Exports