Frontiers in Digital Health (Aug 2022)

Machine learning for maternal health: Predicting delivery location in a community health worker program in Zanzibar

  • Alma Fredriksson,
  • Isabel R. Fulcher,
  • Isabel R. Fulcher,
  • Allyson L. Russell,
  • Tracey Li,
  • Yi-Ting Tsai,
  • Samira S. Seif,
  • Rose N. Mpembeni,
  • Bethany Hedt-Gauthier,
  • Bethany Hedt-Gauthier

DOI
https://doi.org/10.3389/fdgth.2022.855236
Journal volume & issue
Vol. 4

Abstract

Read online

BackgroundMaternal and neonatal health outcomes in low- and middle-income countries (LMICs) have improved over the last two decades. However, many pregnant women still deliver at home, which increases the health risks for both the mother and the child. Community health worker programs have been broadly employed in LMICs to connect women to antenatal care and delivery locations. More recently, employment of digital tools in maternal health programs have resulted in better care delivery and served as a routine mode of data collection. Despite the availability of rich, patient-level data within these digital tools, there has been limited utilization of this type of data to inform program delivery in LMICs.MethodsWe use program data from 38,787 women enrolled in Safer Deliveries, a community health worker program in Zanzibar, to build a generalizable prediction model that accurately predicts whether a newly enrolled pregnant woman will deliver in a health facility. We use information collected during the enrollment visit, including demographic data, health characteristics and current pregnancy information. We apply four machine learning methods: logistic regression, LASSO regularized logistic regression, random forest and an artificial neural network; and three sampling techniques to address the imbalanced data: undersampling of facility deliveries, oversampling of home deliveries and addition of synthetic home deliveries using SMOTE.ResultsOur models correctly predicted the delivery location for 68%–77% of the women in the test set, with slightly higher accuracy when predicting facility delivery versus home delivery. A random forest model with a balanced training set created using undersampling of existing facility deliveries accurately identified 74.4% of women delivering at home.ConclusionsThis model can provide a “real-time” prediction of the delivery location for new maternal health program enrollees and may enable early provision of extra support for individuals at risk of not delivering in a health facility, which has potential to improve health outcomes for both mothers and their newborns. The framework presented here is applicable in other contexts and the selection of input features can easily be adapted to match data availability and other outcomes, both within and beyond maternal health.

Keywords