BMC Public Health (Mar 2025)
Application of the random forest algorithm to predict skilled birth attendance and identify determinants among reproductive-age women in 27 Sub-Saharan African countries; machine learning analysis
Abstract
Abstract Introduction Maternal mortality refers to a mother’s death owing to complications arising from childbirth or pregnancy. This issue is a forefront public health challenge around the globe which is pronounced in low- and middle-income countries, particularly in the sub-Saharan African regions where the burdens remain significantly high. Moreover, this problem is further complicated in developing countries due to limited access to antenatal care and the shortage of skilled birth attendants. So far, considerable improvements in the health status of many populations have been reported in developing countries. Nonetheless, the MDGs to reduce maternal and newborn mortality unmet in many SSA nations. Leveraging machine learning approaches allows us to better understand these constraints and predict skilled birth attendance among reproductive age women, providing actionable insights for policy and intervention. Objective This study aimed to predict skill birth attendance and identify its determinants among reproductive age women in 27 SSA countries using machine learning algorithm. Methods Using data from the Demographic and Health Surveys (2016–2024) across 27 SSA countries, we analyzed responses from 198,707 reproductive age women. The Random Forest classifier, complemented by SHAP for feature interpretability, was employed for prediction and analysis. Data preprocessing included K-nearest neighbor imputation for missing values, SMOTE for handling class imbalance, and Recursive Feature Elimination for feature selection. Model performance was evaluated using metrics such as accuracy, recall, F1 score, and AUC-ROC. Results The Random Forest model demonstrated robust performance, achieving an AUC-ROC of 92%, recall of 96%, accuracy of 92%, precision of 93 and F1 score of 93%. The SHAP analysis identifies key predictors of skilled birth attendance, including facility delivery, maternal education, higher wealth index, urban residence, reduced distance to healthcare facilities, media exposure, and internet use. Conclusion and recommendations The findings highlight the potential of machine learning to identify critical predictors of skilled birth attendance to inform targeted interventions. Addressing socioeconomic and educational disparities, enhancing healthcare access, and implementing tailored cessation programs are crucial to enhance skilled birth attendance in this vulnerable population.
Keywords