Investigating the Relationship between Classification Quality and SMT Performance in Discriminative Reordering Models

Arefeh Kazemi; Antonio Toral; Andy Way; Amirhassan Monadjemi; Mohammadali Nematbakhsh

doi:10.3390/e19090340

Entropy (Aug 2017)

Investigating the Relationship between Classification Quality and SMT Performance in Discriminative Reordering Models

Arefeh Kazemi,
Antonio Toral,
Andy Way,
Amirhassan Monadjemi,
Mohammadali Nematbakhsh

Affiliations

Arefeh Kazemi: Department of Computer Engineering, University of Isfahan, Isfahan 81746-73441, Iran
Antonio Toral: Center for Language and Cognition, University of Groningen, Groningen 9712 EK, The Netherlands
Andy Way: ADAPT Centre, School of Computing, Dublin City University, Dublin 9, Ireland
Amirhassan Monadjemi: Department of Computer Engineering, University of Isfahan, Isfahan 81746-73441, Iran
Mohammadali Nematbakhsh: Department of Computer Engineering, University of Isfahan, Isfahan 81746-73441, Iran

DOI: https://doi.org/10.3390/e19090340
Journal volume & issue: Vol. 19, no. 9
p. 340

Abstract

Read online

Reordering is one of the most important factors affecting the quality of the output in statistical machine translation (SMT). A considerable number of approaches that proposed addressing the reordering problem are discriminative reordering models (DRM). The core component of the DRMs is a classifier which tries to predict the correct word order of the sentence. Unfortunately, the relationship between classification quality and ultimate SMT performance has not been investigated to date. Understanding this relationship will allow researchers to select the classifier that results in the best possible MT quality. It might be assumed that there is a monotonic relationship between classification quality and SMT performance, i.e., any improvement in classification performance will be monotonically reflected in overall SMT quality. In this paper, we experimentally show that this assumption does not always hold, i.e., an improvement in classification performance might actually degrade the quality of an SMT system, from the point of view of MT automatic evaluation metrics. However, we show that if the improvement in the classification performance is high enough, we can expect the SMT quality to improve as well. In addition to this, we show that there is a negative relationship between classification accuracy and SMT performance in imbalanced parallel corpora. For these types of corpora, we provide evidence that, for the evaluation of the classifier, macro-averaged metrics such as macro-averaged F-measure are better suited than accuracy, the metric commonly used to date.

Published in Entropy

ISSN: 1099-4300 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Astronomy: Astrophysics; Science: Physics
Website: http://www.mdpi.com/journal/entropy

About the journal

Abstract

Keywords