Fairness and bias correction in machine learning for depression prediction across four study populations

Vien Ngoc Dang; Anna Cascarano; Rosa H. Mulder; Charlotte Cecil; Maria A. Zuluaga; Jerónimo Hernández-González; Karim Lekadir

doi:10.1038/s41598-024-58427-7

Scientific Reports (Apr 2024)

Fairness and bias correction in machine learning for depression prediction across four study populations

Vien Ngoc Dang,
Anna Cascarano,
Rosa H. Mulder,
Charlotte Cecil,
Maria A. Zuluaga,
Jerónimo Hernández-González,
Karim Lekadir

Affiliations

Vien Ngoc Dang: Departament de Matemàtiques i Informàtica, Facultat de Matemàtiques i Informàtica, Universitat de Barcelona
Anna Cascarano: Departament de Matemàtiques i Informàtica, Facultat de Matemàtiques i Informàtica, Universitat de Barcelona
Rosa H. Mulder: Department of Child and Adolescent Psychiatry/Psychology, Erasmus MC, University Medical Center Rotterdam
Charlotte Cecil: Department of Child and Adolescent Psychiatry/Psychology, Erasmus MC, University Medical Center Rotterdam
Maria A. Zuluaga: Data Science Department, EURECOM
Jerónimo Hernández-González: Departament d’Informàtica, Matemàtica Aplicada i Estadística, Universitat de Girona
Karim Lekadir: Departament de Matemàtiques i Informàtica, Facultat de Matemàtiques i Informàtica, Universitat de Barcelona

DOI: https://doi.org/10.1038/s41598-024-58427-7
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 12

Abstract

Read online

Abstract A significant level of stigma and inequality exists in mental healthcare, especially in under-served populations. Inequalities are reflected in the data collected for scientific purposes. When not properly accounted for, machine learning (ML) models learned from data can reinforce these structural inequalities or biases. Here, we present a systematic study of bias in ML models designed to predict depression in four different case studies covering different countries and populations. We find that standard ML approaches regularly present biased behaviors. We also show that mitigation techniques, both standard and our own post-hoc method, can be effective in reducing the level of unfair bias. There is no one best ML model for depression prediction that provides equality of outcomes. This emphasizes the importance of analyzing fairness during model selection and transparent reporting about the impact of debiasing interventions. Finally, we also identify positive habits and open challenges that practitioners could follow to enhance fairness in their models.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal

Abstract

Keywords