Scientific Reports (Jun 2024)

Mitigating machine learning bias between high income and low–middle income countries for enhanced model fairness and generalizability

  • Jenny Yang,
  • Lei Clifton,
  • Nguyen Thanh Dung,
  • Nguyen Thanh Phong,
  • Lam Minh Yen,
  • Doan Bui Xuan Thy,
  • Andrew A. S. Soltan,
  • Louise Thwaites,
  • David A. Clifton

DOI
https://doi.org/10.1038/s41598-024-64210-5
Journal volume & issue
Vol. 14, no. 1
pp. 1 – 12

Abstract

Read online

Abstract Collaborative efforts in artificial intelligence (AI) are increasingly common between high-income countries (HICs) and low- to middle-income countries (LMICs). Given the resource limitations often encountered by LMICs, collaboration becomes crucial for pooling resources, expertise, and knowledge. Despite the apparent advantages, ensuring the fairness and equity of these collaborative models is essential, especially considering the distinct differences between LMIC and HIC hospitals. In this study, we show that collaborative AI approaches can lead to divergent performance outcomes across HIC and LMIC settings, particularly in the presence of data imbalances. Through a real-world COVID-19 screening case study, we demonstrate that implementing algorithmic-level bias mitigation methods significantly improves outcome fairness between HIC and LMIC sites while maintaining high diagnostic sensitivity. We compare our results against previous benchmarks, utilizing datasets from four independent United Kingdom Hospitals and one Vietnamese hospital, representing HIC and LMIC settings, respectively.