Active Semi-Supervised Learning for Improving Word Alignment

Ambati, Vamshi; Vogel, Stephan; Carbonell, Jaime G.

doi:10.1184/R1/6620918.v1

File(s) stored somewhere else

http://www.cs.cmu.edu/~jgc/publications.html

Please note: Linked content is NOT stored on Carnegie Mellon University and we can't guarantee its availability, quality, security or accept any liability.

Active Semi-Supervised Learning for Improving Word Alignment

journal contribution

posted on 2010-01-01, 00:00 authored by Vamshi Ambati, Stephan Vogel, Jaime G. Carbonell

Word alignment models form an important part of building statistical machine translation systems. Semi-supervised word alignment aims to improve the accuracy of automatic word alignment by incorporating full or partial alignments acquired from humans. Such dedicated elicitation effort is often expensive and depends on availability of bilingual speakers for the language-pair. In this paper we study active learning query strategies to carefully identify highly uncertain or most informative alignment links that are proposed under an unsupervised word alignment model. Manual correction of such informative links can then be applied to create a labeled dataset used by a semi-supervised word alignment model. Our experiments show that using active learning leads to maximal reduction of alignment error rates with reduced human effort.

History

Publisher Statement

Date

2010-01-01

Usage metrics

Keywords

Software Research

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

File(s) stored somewhere else

Active Semi-Supervised Learning for Improving Word Alignment

History

Publisher Statement

Date

Usage metrics

Categories

Keywords

Licence

Exports