Multi-source Transformer for Automatic Post-Editing of Machine Translation Output

Amirhossein Tebbifakhr; Matteo Negri; Marco Turchi

doi:10.4000/ijcol.464

IJCoL (Jun 2019)

Multi-source Transformer for Automatic Post-Editing of Machine Translation Output

Amirhossein Tebbifakhr,
Matteo Negri,
Marco Turchi

Affiliations

Amirhossein Tebbifakhr
Matteo Negri
Marco Turchi

DOI: https://doi.org/10.4000/ijcol.464
Journal volume & issue: Vol. 5, no. 1
pp. 89 – 103

Abstract

Read online

Automatic post-editing (APE) of machine translation (MT) is the task of automatically fixing errors in a machine-translated text by learning from human corrections. Recent APE approaches have shown that best results are obtained by neural multi-source models that correct the raw MT output by also considering information from the corresponding source sentence. In this paper, we pursue this objective by exploiting Transformer (Vaswani et al. 2017), the state-of-the-art architecture in MT. Our approach presents several advantages over previous APE solutions, both from the performance perspective and from an industrial deployment standpoint. Indeed, besides competitive results, our Transformer-based architecture is faster to train (thanks to parallelization) and easier to maintain (thanks to the reliance on a single model rather than a complex, multi-component architecture). These advantages make our approach particularly appealing for the industrial sector, where scalability and cost-efficiency are important factors, complementary to pure performance. Besides introducing a novel architecture, we also validate the common assumption that training neural APE systems with more data always results in stronger models. Along this direction, we show that this assumption does not always hold, and that fine-tuning the system only on small in-domain data can yield higher performance. Furthermore, we try different strategies to better exploit the in-domain data. In particular, we adapt reinforcement learning (RL) techniques to optimize our models by considering task-specific metrics (i.e. BLEU and TER) in addition to maximum likelihood. Our experiments show that, alone, the multi-source approach achieves slight improvements over a competitive APE system based on a recurrent neural network architecture. Further gains are obtained by the full-fledged system, fine-tuned on in-domain data and enhanced with RL optimization techniques. Our best results (with a single multi-source model) significantly improve the performance of the best (and much more complex) system submitted to the WMT 2017 APE shared task.

Published in IJCoL

ISSN: 2499-4553 (Online)
Publisher: Accademia University Press
Country of publisher: Italy
LCC subjects: Social Sciences; Language and Literature: Philology. Linguistics: Computational linguistics. Natural language processing
Website: https://journals.openedition.org/ijcol

About the journal