Improving Non-Autoregressive Machine Translation Using Sentence-Level Semantic Agreement

Shuheng Wang; Heyan Huang; Shumin Shi

doi:10.3390/app12105003

Applied Sciences (May 2022)

Improving Non-Autoregressive Machine Translation Using Sentence-Level Semantic Agreement

Shuheng Wang,
Heyan Huang,
Shumin Shi

Affiliations

Shuheng Wang: School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, China
Heyan Huang: School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100811, China
Shumin Shi: School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100811, China

DOI: https://doi.org/10.3390/app12105003
Journal volume & issue: Vol. 12, no. 10
p. 5003

Abstract

Read online

Theinference stage can be accelerated significantly using a Non-Autoregressive Transformer (NAT). However, the training objective used in the NAT model also aims to minimize the loss between the generated words and the golden words in the reference. Since the dependencies between the target words are lacking, this training objective computed at word level can easily cause semantic inconsistency between the generated and source sentences. To alleviate this issue, we propose a new method, Sentence-Level Semantic Agreement (SLSA), to obtain consistency between the source and generated sentences. Specifically, we utilize contrastive learning to pull the sentence representations of the source and generated sentences closer together. In addition, to strengthen the capability of the encoder, we also integrate an agreement module into the encoder to obtain a better representation of the source sentence. The experiments are conducted on three translation datasets: the WMT 2014 EN → DE task, the WMT 2016 EN → RO task, and the IWSLT 2014 DE → DE task, and the improvement in the NAT model’s performance shows the effect of our proposed method.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords