Computational Linguistics (Jun 2022)

Survey of Low-Resource Machine Translation

  • Barry Haddow,
  • Rachel Bawden,
  • Antonio Valerio Miceli Barone,
  • Jindřich Helcl,
  • Alexandra Birch

DOI
https://doi.org/10.1162/coli_a_00446
Journal volume & issue
Vol. 48, no. 3

Abstract

Read online

We present a survey covering the state of the art in low-resource machine translation (MT) research. There are currently around 7,000 languages spoken in the world and almost all language pairs lack significant resources for training machine translation models. There has been increasing interest in research addressing the challenge of producing useful translation models when very little translated training data is available. We present a summary of this topical research field and provide a description of the techniques evaluated by researchers in several recent shared tasks in low-resource MT.