Reproducible and clinically translatable deep neural networks for cervical screening

Syed Rakin Ahmed; Brian Befano; Andreanne Lemay; Didem Egemen; Ana Cecilia Rodriguez; Sandeep Angara; Kanan Desai; Jose Jeronimo; Sameer Antani; Nicole Campos; Federica Inturrisi; Rebecca Perkins; Aimee Kreimer; Nicolas Wentzensen; Rolando Herrero; Marta del Pino; Wim Quint; Silvia de Sanjose; Mark Schiffman; Jayashree Kalpathy-Cramer

doi:10.1038/s41598-023-48721-1

Scientific Reports (Dec 2023)

Reproducible and clinically translatable deep neural networks for cervical screening

Syed Rakin Ahmed,
Brian Befano,
Andreanne Lemay,
Didem Egemen,
Ana Cecilia Rodriguez,
Sandeep Angara,
Kanan Desai,
Jose Jeronimo,
Sameer Antani,
Nicole Campos,
Federica Inturrisi,
Rebecca Perkins,
Aimee Kreimer,
Nicolas Wentzensen,
Rolando Herrero,
Marta del Pino,
Wim Quint,
Silvia de Sanjose,
Mark Schiffman,
Jayashree Kalpathy-Cramer

Affiliations

Syed Rakin Ahmed: Athinoula A. Martinos Center for Biomedical Imaging, Department of Radiology, Massachusetts General Hospital
Brian Befano: Information Management Services
Andreanne Lemay: Athinoula A. Martinos Center for Biomedical Imaging, Department of Radiology, Massachusetts General Hospital
Didem Egemen: Clinical Epidemiology Unit, Clinical Genetics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health
Ana Cecilia Rodriguez: Clinical Epidemiology Unit, Clinical Genetics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health
Sandeep Angara: Computational Health Research Branch, National Library of Medicine, Lister Hill Center
Kanan Desai: Clinical Epidemiology Unit, Clinical Genetics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health
Jose Jeronimo: Clinical Epidemiology Unit, Clinical Genetics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health
Sameer Antani: Computational Health Research Branch, National Library of Medicine, Lister Hill Center
Nicole Campos: Department of Health Policy and Management, Harvard T.H. Chan School of Public Health
Federica Inturrisi: Clinical Epidemiology Unit, Clinical Genetics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health
Rebecca Perkins: Department of Obstetrics & Gynecology, Boston University Chobanian & Avedisian School of Medicine
Aimee Kreimer: Clinical Epidemiology Unit, Clinical Genetics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health
Nicolas Wentzensen: Clinical Epidemiology Unit, Clinical Genetics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health
Rolando Herrero: Agencia Costarricense de Investigaciones Biomedicas (ACIB), Fundacion INCIENSA
Marta del Pino: Hospital Clinic
Wim Quint: DDL Diagnostic Laboratory
Silvia de Sanjose: Clinical Epidemiology Unit, Clinical Genetics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health
Mark Schiffman: Clinical Epidemiology Unit, Clinical Genetics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health
Jayashree Kalpathy-Cramer: Athinoula A. Martinos Center for Biomedical Imaging, Department of Radiology, Massachusetts General Hospital

DOI: https://doi.org/10.1038/s41598-023-48721-1
Journal volume & issue: Vol. 13, no. 1
pp. 1 – 18

Abstract

Read online

Abstract Cervical cancer is a leading cause of cancer mortality, with approximately 90% of the 250,000 deaths per year occurring in low- and middle-income countries (LMIC). Secondary prevention with cervical screening involves detecting and treating precursor lesions; however, scaling screening efforts in LMIC has been hampered by infrastructure and cost constraints. Recent work has supported the development of an artificial intelligence (AI) pipeline on digital images of the cervix to achieve an accurate and reliable diagnosis of treatable precancerous lesions. In particular, WHO guidelines emphasize visual triage of women testing positive for human papillomavirus (HPV) as the primary screen, and AI could assist in this triage task. In this work, we implemented a comprehensive deep-learning model selection and optimization study on a large, collated, multi-geography, multi-institution, and multi-device dataset of 9462 women (17,013 images). We evaluated relative portability, repeatability, and classification performance. The top performing model, when combined with HPV type, achieved an area under the Receiver Operating Characteristics (ROC) curve (AUC) of 0.89 within our study population of interest, and a limited total extreme misclassification rate of 3.4%, on held-aside test sets. Our model also produced reliable and consistent predictions, achieving a strong quadratic weighted kappa (QWK) of 0.86 and a minimal %2-class disagreement (% 2-Cl. D.) of 0.69%, between image pairs across women. Our work is among the first efforts at designing a robust, repeatable, accurate and clinically translatable deep-learning model for cervical screening.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal