External validation of machine learning models including newborn metabolomic markers for postnatal gestational age estimation in East and South-East Asian infants [version 2; peer review: 1 approved, 3 approved with reservations]

Steven Hawken; Malia S. Q. Murphy; Robin Ducharme; A. Brianne Bota; Lindsay A. Wilson; Wei Cheng; Ma-Am Joy Tumulak; Maria Melanie Liberty Alcausin; Ma Elouisa Reyes; Wenjuan Qiu; Beth K. Potter; Julian Little; Mark Walker; Lin Zhang; Carmencita Padilla; Pranesh Chakraborty; Kumanan Wilson

doi:10.12688/gatesopenres.13131.2

Gates Open Research (Jun 2021)

External validation of machine learning models including newborn metabolomic markers for postnatal gestational age estimation in East and South-East Asian infants [version 2; peer review: 1 approved, 3 approved with reservations]

Steven Hawken,
Malia S. Q. Murphy,
Robin Ducharme,
A. Brianne Bota,
Lindsay A. Wilson,
Wei Cheng,
Ma-Am Joy Tumulak,
Maria Melanie Liberty Alcausin,
Ma Elouisa Reyes,
Wenjuan Qiu,
Beth K. Potter,
Julian Little,
Mark Walker,
Lin Zhang,
Carmencita Padilla,
Pranesh Chakraborty,
Kumanan Wilson

Affiliations

Steven Hawken: Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, ON, Canada
Malia S. Q. Murphy: Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, ON, Canada
Robin Ducharme: Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, ON, Canada
A. Brianne Bota: Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, ON, Canada
Lindsay A. Wilson: Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, ON, Canada
Wei Cheng: Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, ON, Canada
Ma-Am Joy Tumulak: Newborn Screening Reference Centre, University of the Philippines Manila, Manila, Philippines
Maria Melanie Liberty Alcausin: Newborn Screening Reference Centre, University of the Philippines Manila, Manila, Philippines
Ma Elouisa Reyes: Newborn Screening Reference Centre, University of the Philippines Manila, Manila, Philippines
Wenjuan Qiu: Pediatric Endocrinology and Genetic Metabolism, XinHua Hospital, Shanghai, Shanghai, China
Beth K. Potter: School of Epidemiology and Public Health, University of Ottawa, Ottawa, ON, Canada
Julian Little: School of Epidemiology and Public Health, University of Ottawa, Ottawa, ON, Canada
Mark Walker: Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, ON, Canada
Lin Zhang: Department of Gynecology and Obsetrics, XinHua Hospital, Shanghai, Shanghai, China
Carmencita Padilla: Institute of Human Genetics, National Institutes of Health, University of Philippines Manila, Manila, Philippines
Pranesh Chakraborty: Newborn Screening Ontario, Children's Hospital of Eastern Ontario, Ottawa, ON, Canada
Kumanan Wilson: Department of Medicine, University of Ottowa, Ottowa, ON, Canada

DOI: https://doi.org/10.12688/gatesopenres.13131.2
Journal volume & issue: Vol. 4

Abstract

Read online

Background: Postnatal gestational age (GA) algorithms derived from newborn metabolic profiles have emerged as a novel method of acquiring population-level preterm birth estimates in low resource settings. To date, model development and validation have been carried out in North American settings. Validation outside of these settings is warranted. Methods: This was a retrospective database study using data from newborn screening programs in Canada, the Philippines and China. ELASTICNET machine learning models were developed to estimate GA in a cohort of infants from Canada using sex, birth weight and metabolomic markers from newborn heel prick blood samples. Final models were internally validated in an independent sample of Canadian infants, and externally validated in infant cohorts from the Philippines and China. Results: Cohorts included 39,666 infants from Canada, 82,909 from the Philippines and 4,448 from China. For the full model including sex, birth weight and metabolomic markers, GA estimates were within ±5 days of ultrasound values in the Canadian internal validation (mean absolute error (MAE) 0.71, 95% CI: 0.71, 0.72), and within ±6 days of ultrasound GA in both the Filipino (0.90 (0.90, 0.91)) and Chinese cohorts (0.89 (0.86, 0.92)). Despite the decreased accuracy in external settings, our models incorporating metabolomic markers performed better than the baseline model, which relied on sex and birth weight alone. In preterm and growth-restricted infants, the accuracy of metabolomic models was markedly higher than the baseline model. Conclusions: Accuracy of metabolic GA algorithms was attenuated when applied in external settings. Models including metabolomic markers demonstrated higher accuracy than models using sex and birth weight alone. As innovators look to take this work to scale, further investigation of modeling and data normalization techniques will be needed to improve robustness and generalizability of metabolomic GA estimates in low resource settings, where this could have the most clinical utility

Published in Gates Open Research

ISSN: 2572-4754 (Online)
Publisher: F1000 Research Ltd
Country of publisher: United Kingdom
LCC subjects: Medicine
Website: https://gatesopenresearch.org/

About the journal