Communications Biology (Feb 2021)

Systematic auditing is essential to debiasing machine learning in biology

  • Fatma-Elzahraa Eid,
  • Haitham A. Elmarakeby,
  • Yujia Alina Chan,
  • Nadine Fornelos,
  • Mahmoud ElHefnawi,
  • Eliezer M. Van Allen,
  • Lenwood S. Heath,
  • Kasper Lage

DOI
https://doi.org/10.1038/s42003-021-01674-5
Journal volume & issue
Vol. 4, no. 1
pp. 1 – 9

Abstract

Read online

Fatma-Elzahraa Eid et al. illustrate a principled approach for identifying biases that can inflate the performance of biological machine learning models. When applied to three biomedical prediction problems, they identify previously unrecognized biases and ultimately show that models are likely to learn primarily from data biases when there is insufficient learnable signal in the data.