Measuring algorithmic bias to analyze the reliability of AI tools that predict depression risk using smartphone sensed-behavioral data

Daniel A. Adler; Caitlin A. Stamatis; Jonah Meyerhoff; David C. Mohr; Fei Wang; Gabriel J. Aranovich; Srijan Sen; Tanzeem Choudhury

doi:10.1038/s44184-024-00057-y

npj Mental Health Research (Apr 2024)

Measuring algorithmic bias to analyze the reliability of AI tools that predict depression risk using smartphone sensed-behavioral data

Daniel A. Adler,
Caitlin A. Stamatis,
Jonah Meyerhoff,
David C. Mohr,
Fei Wang,
Gabriel J. Aranovich,
Srijan Sen,
Tanzeem Choudhury

Affiliations

Daniel A. Adler: Cornell Tech, Information Science
Caitlin A. Stamatis: Northwestern University Feinberg School of Medicine, Center for Behavioral Intervention Technologies
Jonah Meyerhoff: Northwestern University Feinberg School of Medicine, Center for Behavioral Intervention Technologies
David C. Mohr: Northwestern University Feinberg School of Medicine, Center for Behavioral Intervention Technologies
Fei Wang: Weill Cornell Medicine, Population Health Sciences
Gabriel J. Aranovich: Cornell Tech, Information Science
Srijan Sen: Michigan Medicine, Department of Psychiatry
Tanzeem Choudhury: Cornell Tech, Information Science

DOI: https://doi.org/10.1038/s44184-024-00057-y
Journal volume & issue: Vol. 3, no. 1
pp. 1 – 11

Abstract

Read online

Abstract AI tools intend to transform mental healthcare by providing remote estimates of depression risk using behavioral data collected by sensors embedded in smartphones. While these tools accurately predict elevated depression symptoms in small, homogenous populations, recent studies show that these tools are less accurate in larger, more diverse populations. In this work, we show that accuracy is reduced because sensed-behaviors are unreliable predictors of depression across individuals: sensed-behaviors that predict depression risk are inconsistent across demographic and socioeconomic subgroups. We first identified subgroups where a developed AI tool underperformed by measuring algorithmic bias, where subgroups with depression were incorrectly predicted to be at lower risk than healthier subgroups. We then found inconsistencies between sensed-behaviors predictive of depression across these subgroups. Our findings suggest that researchers developing AI tools predicting mental health from sensed-behaviors should think critically about the generalizability of these tools, and consider tailored solutions for targeted populations.

Published in npj Mental Health Research

ISSN: 2731-4251 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine: Internal medicine: Neurosciences. Biological psychiatry. Neuropsychiatry: Neurology. Diseases of the nervous system: Psychiatry: Therapeutics. Psychotherapy
Website: https://www.nature.com/npjmentalhealth/

About the journal