Fairness and generalisability in deep learning of retinopathy of prematurity screening algorithms: a literature review

Leo Anthony Celi; Alvina Pauline Dy Santiago; Luis Filipe Nakayama; Lucas Zago Ribeiro; Caio Vinicius Saito Regatieri; Khumbo Kalua; William Greig Mitchell; Warachaya Phanphruk; Robyn Gayle Dychiao; Nilva Simeren Bueno Moraes

doi:10.1136/bmjophth-2022-001216

BMJ Open Ophthalmology (Dec 2023)

Fairness and generalisability in deep learning of retinopathy of prematurity screening algorithms: a literature review

Leo Anthony Celi,
Alvina Pauline Dy Santiago,
Luis Filipe Nakayama,
Lucas Zago Ribeiro,
Caio Vinicius Saito Regatieri,
Khumbo Kalua,
William Greig Mitchell,
Warachaya Phanphruk,
Robyn Gayle Dychiao,
Nilva Simeren Bueno Moraes

Affiliations

Leo Anthony Celi: Division of Pulmonary, Critical Care and Sleep Medicine, Beth Israel Deaconess Medical Center, Boston, MA 02215, USA
Alvina Pauline Dy Santiago: Department of Ophthalmology and Visual Sciences, Philippine General Hospital, Manila, Philippines
Luis Filipe Nakayama: Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
Lucas Zago Ribeiro: Department of Ophthalmology, Sao Paulo Federal University, Sao Paulo, Brazil
Caio Vinicius Saito Regatieri: Department of Ophthalmology, Sao Paulo Federal University, Sao Paulo, Brazil
Khumbo Kalua: Department of Ophthalmology, Blantyre Institute for Community Ophthalmology, BICO, Blantyre, Malawi
William Greig Mitchell: Department of Ophthalmology, The Royal Victorian Eye and Ear Hospital, East Melbourne, Victoria, Australia
Warachaya Phanphruk: Department of Ophthalmology, Khon Kaen University, Nai Mueang, Thailand
Robyn Gayle Dychiao: University of the Philippines Manila College of Medicine, Manila, Philippines
Nilva Simeren Bueno Moraes: Department of Ophthalmology, Sao Paulo Federal University, Sao Paulo, Brazil

DOI: https://doi.org/10.1136/bmjophth-2022-001216
Journal volume & issue: Vol. 8, no. 1

Abstract

Read online

Background Retinopathy of prematurity (ROP) is a vasoproliferative disease responsible for more than 30 000 blind children worldwide. Its diagnosis and treatment are challenging due to the lack of specialists, divergent diagnostic concordance and variation in classification standards. While artificial intelligence (AI) can address the shortage of professionals and provide more cost-effective management, its development needs fairness, generalisability and bias controls prior to deployment to avoid producing harmful unpredictable results. This review aims to compare AI and ROP study’s characteristics, fairness and generalisability efforts.Methods Our review yielded 220 articles, of which 18 were included after full-text assessment. The articles were classified into ROP severity grading, plus detection, detecting treatment requiring, ROP prediction and detection of retinal zones.Results All the article’s authors and included patients are from middle-income and high-income countries, with no low-income countries, South America, Australia and Africa Continents representation.Code is available in two articles and in one on request, while data are not available in any article. 88.9% of the studies use the same retinal camera. In two articles, patients’ sex was described, but none applied a bias control in their models.Conclusion The reviewed articles included 180 228 images and reported good metrics, but fairness, generalisability and bias control remained limited. Reproducibility is also a critical limitation, with few articles sharing codes and none sharing data. Fair and generalisable ROP and AI studies are needed that include diverse datasets, data and code sharing, collaborative research, and bias control to avoid unpredictable and harmful deployments.

Published in BMJ Open Ophthalmology

ISSN: 2397-3269 (Online)
Publisher: BMJ Publishing Group
Country of publisher: United Kingdom
LCC subjects: Medicine: Ophthalmology
Website: http://bmjophth.bmj.com/

About the journal