Scaling and Disagreements: Bias, Noise, and Ambiguity

Alexandra Uma; Dina Almanea; Massimo Poesio; Massimo Poesio; Massimo Poesio

doi:10.3389/frai.2022.818451

Frontiers in Artificial Intelligence (Apr 2022)

Scaling and Disagreements: Bias, Noise, and Ambiguity

Alexandra Uma,
Dina Almanea,
Massimo Poesio,
Massimo Poesio,
Massimo Poesio

Affiliations

Alexandra Uma: Computational Linguistics Lab, School of Electronic Engineering and Computer Science, Queen Mary University of London, London, United Kingdom
Dina Almanea: Computational Linguistics Lab, School of Electronic Engineering and Computer Science, Queen Mary University of London, London, United Kingdom
Massimo Poesio: Computational Linguistics Lab, School of Electronic Engineering and Computer Science, Queen Mary University of London, London, United Kingdom
Massimo Poesio: Digital Environment Research Institute, Queen Mary University of London, London, United Kingdom
Massimo Poesio: Turing Institute, London, United Kingdom

DOI: https://doi.org/10.3389/frai.2022.818451
Journal volume & issue: Vol. 5

Abstract

Read online

Crowdsourced data are often rife with disagreement, either because of genuine item ambiguity, overlapping labels, subjectivity, or annotator error. Hence, a variety of methods have been developed for learning from data containing disagreement. One of the observations emerging from this work is that different methods appear to work best depending on characteristics of the dataset such as the level of noise. In this paper, we investigate the use of an approach developed to estimate noise, temperature scaling, in learning from data containing disagreements. We find that temperature scaling works with data in which the disagreements are the result of label overlap, but not with data in which the disagreements are due to annotator bias, as in, e.g., subjective tasks such as labeling an item as offensive or not. We also find that disagreements due to ambiguity do not fit perfectly either category.

Published in Frontiers in Artificial Intelligence

ISSN: 2624-8212 (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.frontiersin.org/journals/artificial-intelligence#

About the journal

Abstract

Keywords