Quantifying Social Biases in NLP: A Generalization and Empirical Comparison of Extrinsic Fairness Metrics

Paula Czarnowska; Yogarshi Vyas; Kashif Shah

doi:10.1162/tacl_a_00425

Transactions of the Association for Computational Linguistics (Jan 2021)

Quantifying Social Biases in NLP: A Generalization and Empirical Comparison of Extrinsic Fairness Metrics

Paula Czarnowska,
Yogarshi Vyas,
Kashif Shah

Affiliations

Paula Czarnowska: University of Cambridge, UK. [email protected]
Yogarshi Vyas: Amazon AI, USA. [email protected]
Kashif Shah: Amazon AI, USA. [email protected]

DOI: https://doi.org/10.1162/tacl_a_00425
Journal volume & issue: Vol. 9
pp. 1249 – 1267

Abstract

Read online

AbstractMeasuring bias is key for better understanding and addressing unfairness in NLP/ML models. This is often done via fairness metrics, which quantify the differences in a model’s behaviour across a range of demographic groups. In this work, we shed more light on the differences and similarities between the fairness metrics used in NLP. First, we unify a broad range of existing metrics under three generalized fairness metrics, revealing the connections between them. Next, we carry out an extensive empirical comparison of existing metrics and demonstrate that the observed differences in bias measurement can be systematically explained via differences in parameter choices for our generalized metrics.

Published in Transactions of the Association for Computational Linguistics

ISSN: 2307-387X (Online)
Publisher: The MIT Press
Country of publisher: United States
LCC subjects: Language and Literature: Philology. Linguistics: Computational linguistics. Natural language processing
Website: https://direct.mit.edu/tacl

About the journal