Good, but not always Fair:  An Evaluation of Gender Bias for three Commercial Machine Translation Systems

Silvia Alma Piazzolla; Beatrice Savoldi; Luisa Bentivogli

doi:10.7146/hjlcb.vi63.137553

Hermes (Jan 2024)

Good, but not always Fair: An Evaluation of Gender Bias for three Commercial Machine Translation Systems

Silvia Alma Piazzolla,
Beatrice Savoldi,
Luisa Bentivogli

Affiliations

Silvia Alma Piazzolla: University of Trento
Beatrice Savoldi: Fondazione Bruno Kessler
Luisa Bentivogli: Fondazione Bruno Kessler

DOI: https://doi.org/10.7146/hjlcb.vi63.137553
Journal volume & issue: no. 63

Abstract

Read online

Machine Translation (MT) continues to make significant strides in quality and is increasingly adopted on a larger scale. Consequently, analyses have been redirected to more nuanced aspects, intricate phenomena, as well as potential risks that may arise from the widespread use of MT tools. Along this line, this paper offers a meticulous assessment of three commercial MT systems - Google Translate, DeepL, and Modern MT - with a specific focus on gender translation and bias. For three language pairs (English-Spanish, English-Italian, and English-French), we scrutinize the behavior of such systems at several levels of granularity and on a variety of naturally occurring gender phenomena in translation. Our study takes stock of the current state of online MT tools, by revealing significant discrepancies in the gender translation of the three systems, with each system displaying varying degrees of bias despite their overall translation quality.

Published in Hermes

ISSN: 0904-1699 (Print); 1903-1785 (Online)
Publisher: Aarhus University
Country of publisher: Denmark
LCC subjects: Social Sciences: Commerce: Business: Business communication. Including business report writing, business correspondence
Website: https://tidsskrift.dk/her/index

About the journal

Abstract

Keywords