Extending memory-based machine translation to phrases

M. Van Gompel, A. Van den Bosch, P. Berck

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Abstract

We present a phrase-based extension to memory-based machine translation. This form of examplebased machine translation employs lazy-learning classifiers to translate fragments of the source sentence to fragments of the target sentence. Source-side fragments consist of variable-length phrases
in a local context of neighboring words, translated by the classifier to a target-language phrase. We
compare three methods of phrase extraction, and present a new decoder that reassembles the translated fragments into one final translation. Results show that one of the proposed phrase-extraction
methods—the one used in Moses—leads to a translation system that outperforms context-sensitive
word-based approaches. The differences, however, are small, arguably because the word-based approaches already capture phrasal context implicitly due to their source-side and target-side context
sensitivity.
Original languageEnglish
Title of host publicationComputational Linguistics in the Netherlands 2010: Selected Papers from the Twentieth CLIN Meeting
PublisherAssociation for Computational Linguistics
Publication statusPublished - 2011
Externally publishedYes

Fingerprint

Dive into the research topics of 'Extending memory-based machine translation to phrases'. Together they form a unique fingerprint.

Cite this