Prediction of Emergency Cesarean Section Using Machine Learning Methods: Development and External Validation of a Nationwide Multicenter Dataset in Republic of Korea
Jeong Ha Wie,
Se Jin Lee,
Sae Kyung Choi,
Yun Sung Jo,
Han Sung Hwang,
Mi Hye Park,
Yeon Hee Kim,
Jae Eun Shin,
Ki Cheol Kil,
Su Mi Kim,
Bong Suk Choi,
Hanul Hong,
Hyun-Joo Seol,
Hye-Sung Won,
Hyun Sun Ko,
Sunghun Na
Affiliations
Jeong Ha Wie
Department of Obstetrics and Gynecology, Eunpyeong St. Mary’s Hospital, College of Medicine, The Catholic University of Korea, Seoul 03312, Korea
Se Jin Lee
Department of Obstetrics and Gynecology, Kangwon National University Hospital, Kangwon National University School of Medicine, Chuncheon 24289, Korea
Sae Kyung Choi
Department of Obstetrics and Gynecology, Incheon St. Mary’s Hospital, College of Medicine, The Catholic University of Korea, Seoul 21431, Korea
Yun Sung Jo
Department of Obstetrics and Gynecology, St. Vincent’s Hospital, College of Medicine, The Catholic University of Korea, Seoul 16247, Korea
Han Sung Hwang
Department of Obstetrics and Gynecology, Research Institute of Medical Science, Konkuk University School of Medicine, Seoul 05030, Korea
Mi Hye Park
Department of Obstetrics and Gynecology, Ewha Medical Center, Ewha Medical Institute, Ewha Womans University College of Medicine, Seoul 07804, Korea
Yeon Hee Kim
Department of Obstetrics and Gynecology, Uijeongbu St. Mary’s Hospital, College of Medicine, The Catholic University of Korea, Seoul 11765, Korea
Jae Eun Shin
Department of Obstetrics and Gynecology, Bucheon St. Mary’s Hospital, College of Medicine, The Catholic University of Korea, Seoul 14647, Korea
Ki Cheol Kil
Department of Obstetrics and Gynecology, Yeouido St. Mary’s Hospital, College of Medicine, The Catholic University of Korea, Seoul 07345, Korea
Su Mi Kim
Department of Obstetrics and Gynecology, Daejeon St. Mary’s Hospital, College of Medicine, The Catholic University of Korea, Seoul 34943, Korea
Bong Suk Choi
Innerwave Co., Ltd., Seoul 08510, Korea
Hanul Hong
Innerwave Co., Ltd., Seoul 08510, Korea
Hyun-Joo Seol
Department of Obstetrics and Gynecology, School of Medicine, Kyung Hee University, Seoul 05278, Korea
Hye-Sung Won
Department of Obstetrics and Gynecology, Asan Medical Center, University of Ulsan College of Medicine, Seoul 05505, Korea
Hyun Sun Ko
Department of Obstetrics and Gynecology, Seoul St. Mary’s Hospital, College of Medicine, The Catholic University of Korea, Seoul 06591, Korea
Sunghun Na
Department of Obstetrics and Gynecology, Kangwon National University Hospital, Kangwon National University School of Medicine, Chuncheon 24289, Korea
This study was a multicenter retrospective cohort study of term nulliparous women who underwent labor, and was conducted to develop an automated machine learning model for prediction of emergent cesarean section (CS) before onset of labor. Nine machine learning methods of logistic regression, random forest, Support Vector Machine (SVM), gradient boosting, extreme gradient boosting (XGBoost), light gradient boosting machine (LGBM), k-nearest neighbors (KNN), Voting, and Stacking were applied and compared for prediction of emergent CS during active labor. External validation was performed using a nationwide multicenter dataset for Korean fetal growth. A total of 6549 term nulliparous women was included in the analysis, and the emergent CS rate was 16.1%. The C-statistics values for KNN, Voting, XGBoost, Stacking, gradient boosting, random forest, LGBM, logistic regression, and SVM were 0.6, 0.69, 0.64, 0.59, 0.66, 0.68, 0.68, 0.7, and 0.69, respectively. The logistic regression model showed the best predictive performance with an accuracy of 0.78. The machine learning model identified nine significant variables of maternal age, height, weight at pre-pregnancy, pregnancy-associated hypertension, gestational age, and fetal sonographic findings. The C-statistic value for the logistic regression machine learning model in the external validation set (1391 term nulliparous women) was 0.69, with an overall accuracy of 0.68, a specificity of 0.83, and a sensitivity of 0.41. Machine learning algorithms with clinical and sonographic parameters at near term could be useful tools to predict individual risk of emergent CS during active labor in nulliparous women.