Context-Based Patterns in Machine Learning Bias and Fairness Metrics: A Sensitive Attributes-Based Approach

Tiago P. Pagano; Rafael B. Loureiro; Fernanda V. N. Lisboa; Gustavo O. R. Cruz; Rodrigo M. Peixoto; Guilherme A. de Sousa Guimarães; Ewerton L. S. Oliveira; Ingrid Winkler; Erick G. Sperandio Nascimento

doi:10.3390/bdcc7010027

Big Data and Cognitive Computing (Jan 2023)

Context-Based Patterns in Machine Learning Bias and Fairness Metrics: A Sensitive Attributes-Based Approach

Tiago P. Pagano,
Rafael B. Loureiro,
Fernanda V. N. Lisboa,
Gustavo O. R. Cruz,
Rodrigo M. Peixoto,
Guilherme A. de Sousa Guimarães,
Ewerton L. S. Oliveira,
Ingrid Winkler,
Erick G. Sperandio Nascimento

Affiliations

Tiago P. Pagano: Computational Modeling Department, SENAI CIMATEC University Center, Salvador 41650-010, BA, Brazil
Rafael B. Loureiro: Computational Modeling Department, SENAI CIMATEC University Center, Salvador 41650-010, BA, Brazil
Fernanda V. N. Lisboa: Computer Engineering, SENAI CIMATEC University Center, Salvador 41650-010, BA, Brazil
Gustavo O. R. Cruz: Computer Engineering, SENAI CIMATEC University Center, Salvador 41650-010, BA, Brazil
Rodrigo M. Peixoto: Software Development Department, Salvador 41650-010, BA, Brazil
Guilherme A. de Sousa Guimarães: Software Development Department, Salvador 41650-010, BA, Brazil
Ewerton L. S. Oliveira: HP Inc. Brazil R&D, Porto Alegre 90619-900, RS, Brazil
Ingrid Winkler: Management and Industrial Technology Department, SENAI CIMATEC University Center, Salvador 41650-010, BA, Brazil
Erick G. Sperandio Nascimento: Computational Modeling Department, SENAI CIMATEC University Center, Salvador 41650-010, BA, Brazil

DOI: https://doi.org/10.3390/bdcc7010027
Journal volume & issue: Vol. 7, no. 1
p. 27

Abstract

Read online

The majority of current approaches for bias and fairness identification or mitigation in machine learning models are applications for a particular issue that fails to account for the connection between the application context and its associated sensitive attributes, which contributes to the recognition of consistent patterns in the application of bias and fairness metrics. This can be used to drive the development of future models, with the sensitive attribute acting as a connecting element to these metrics. Hence, this study aims to analyze patterns in several metrics for identifying bias and fairness, applying the gender-sensitive attribute as a case study, for three different areas of applications in machine learning models: computer vision, natural language processing, and recommendation systems. The gender attribute case study has been used in computer vision, natural language processing, and recommendation systems. The method entailed creating use cases for facial recognition in the FairFace dataset, message toxicity in the Jigsaw dataset, and movie recommendations in the MovieLens100K dataset, then developing models based on the VGG19, BERT, and Wide Deep architectures and evaluating them using the accuracy, precision, recall, and F1-score classification metrics, as well as assessing their outcomes using fourteen fairness metrics. Certain metrics disclosed bias and fairness, while others did not, revealing a consistent pattern for the same sensitive attribute across different application domains, and similarities for the statistical parity, PPR disparity, and error disparity metrics across domains, indicating fairness related to the studied sensitive attribute. Some attributes, on the other hand, did not follow this pattern. As a result, we conclude that the sensitive attribute may play a crucial role in defining the fairness metrics for a specific context.

Published in Big Data and Cognitive Computing

ISSN: 2504-2289 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology
Website: http://www.mdpi.com/journal/BDCC

About the journal

Abstract

Keywords