Similarity-based Curriculum Learning for Multilingual Neural Machine Translation

YU Dong, XIE Wan-ying, GU Shu-hao, FENG Yang

doi:10.11896/jsjkx.210800254

Jisuanji kexue (Jan 2022)

Similarity-based Curriculum Learning for Multilingual Neural Machine Translation

YU Dong, XIE Wan-ying, GU Shu-hao, FENG Yang

Affiliations

YU Dong, XIE Wan-ying, GU Shu-hao, FENG Yang: 1 College of Information Sciences,Beijing Language and Culture University,Beijing 100083,China<br/>2 Institute of Computing Technology,Chinese Academy of Sciences,Beijing 100190,China<br/>3 University of Chinese Academy of Sciences,Beijing 100049,China

DOI: https://doi.org/10.11896/jsjkx.210800254
Journal volume & issue: Vol. 49, no. 1
pp. 24 – 30

Abstract

Read online

Multilingual neural machine translation (MNMT) with a single model has drawn more attention due to its capability to deal with multiple languages.However,the current multilingual translation paradigm does not make use of the similar features embodied in different languages,which has already been proven useful for improving the multilingual translation.Besides,the training of multilingual model is usually very time-consuming due to the huge amount of training data.To address these problems,we propose a similarity-based curriculum learning method to improve the overall performance and convergence speed.We propose two hierarchical criteria for measuring the similarity,one is for ranking different languages (inter-language) with singular vector canonical correlation analysis,and the other is for ranking different sentences in a particular language (intra-language) with cosine similarity.At the same time,the paper proposes a curriculum learning strategy that takes the loss of validation set as the curriculum replacement standard.We conduct experiments on balanced and unbalanced IWSLT multilingual data sets and Europarl corpus datasets.The results demonstrate that the proposed method outperforms strong multilingual translation systems and can achieve up to a 64% decrease in training time.

machine translation|multilingual|curriculum learning|similarity evaluation|language ranking|sentence ranking

Published in Jisuanji kexue

ISSN: 1002-137X (Print)
Publisher: Editorial office of Computer Science
Country of publisher: China
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software; Technology: Technology (General)
Website: http://www.jsjkx.com/CN/1002-137X/home.shtml

About the journal

Abstract

Keywords