Prediction of gestational diabetes mellitus in Asian women using machine learning algorithms

Byung Soo Kang; Seon Ui Lee; Subeen Hong; Sae Kyung Choi; Jae Eun Shin; Jeong Ha Wie; Yun Sung Jo; Yeon Hee Kim; Kicheol Kil; Yoo Hyun Chung; Kyunghoon Jung; Hanul Hong; In Yang Park; Hyun Sun Ko

doi:10.1038/s41598-023-39680-8

Scientific Reports (Aug 2023)

Prediction of gestational diabetes mellitus in Asian women using machine learning algorithms

Byung Soo Kang,
Seon Ui Lee,
Subeen Hong,
Sae Kyung Choi,
Jae Eun Shin,
Jeong Ha Wie,
Yun Sung Jo,
Yeon Hee Kim,
Kicheol Kil,
Yoo Hyun Chung,
Kyunghoon Jung,
Hanul Hong,
In Yang Park,
Hyun Sun Ko

Affiliations

Byung Soo Kang: Department of Obstetrics and Gynecology, Seoul St. Mary’s Hospital, College of Medicine, The Catholic University of Korea
Seon Ui Lee: Department of Obstetrics and Gynecology, St. Vincent’s Hospital, College of Medicine, The Catholic University of Korea
Subeen Hong: Department of Obstetrics and Gynecology, Seoul St. Mary’s Hospital, College of Medicine, The Catholic University of Korea
Sae Kyung Choi: Department of Obstetrics and Gynecology, Incheon St. Mary’s Hospital, College of Medicine, The Catholic University of Korea
Jae Eun Shin: Department of Obstetrics and Gynecology, Bucheon St. Mary’s Hospital, College of Medicine, The Catholic University of Korea
Jeong Ha Wie: Department of Obstetrics and Gynecology, Eunpyeong St. Mary’s Hospital, College of Medicine, The Catholic University of Korea
Yun Sung Jo: Department of Obstetrics and Gynecology, St. Vincent’s Hospital, College of Medicine, The Catholic University of Korea
Yeon Hee Kim: Department of Obstetrics and Gynecology, Uijeongbu St. Mary’s Hospital,, College of Medicine, The Catholic University of Korea
Kicheol Kil: Department of Obstetrics and Gynecology, Yeouido St. Mary’s Hospital, College of Medicine, The Catholic University of Korea
Yoo Hyun Chung: Department of Obstetrics and Gynecology, Daejeon St. Mary’s Hospital, College of Medicine, The Catholic University of Korea
Kyunghoon Jung: Innerwave Co., Ltd
Hanul Hong: Innerwave Co., Ltd
In Yang Park: Department of Obstetrics and Gynecology, Seoul St. Mary’s Hospital, College of Medicine, The Catholic University of Korea
Hyun Sun Ko: Department of Obstetrics and Gynecology, Seoul St. Mary’s Hospital, College of Medicine, The Catholic University of Korea

DOI: https://doi.org/10.1038/s41598-023-39680-8
Journal volume & issue: Vol. 13, no. 1
pp. 1 – 10

Abstract

Read online

Abstract This study developed a machine learning algorithm to predict gestational diabetes mellitus (GDM) using retrospective data from 34,387 pregnancies in multi-centers of South Korea. Variables were collected at baseline, E0 (until 10 weeks’ gestation), E1 (11–13 weeks’ gestation) and M1 (14–24 weeks’ gestation). The data set was randomly divided into training and test sets (7:3 ratio) to compare the performances of light gradient boosting machine (LGBM) and extreme gradient boosting (XGBoost) algorithms, with a full set of variables (original). A prediction model with the whole cohort achieved area under the receiver operating characteristics curve (AUC) and area under the precision-recall curve (AUPR) values of 0.711 and 0.246 at baseline, 0.720 and 0.256 at E0, 0.721 and 0.262 at E1, and 0.804 and 0.442 at M1, respectively. Then comparison of three models with different variable sets were performed: [a] variables from clinical guidelines; [b] selected variables from Shapley additive explanations (SHAP) values; and [c] Boruta algorithms. Based on model [c] with the least variables and similar or better performance than the other models, simple questionnaires were developed. The combined use of maternal factors and laboratory data could effectively predict individual risk of GDM using a machine learning model.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal