Machine Learning Prediction Models for Gestational Diabetes Mellitus: Meta-analysis

Zheqing Zhang; Luqian Yang; Wentao Han; Yaoyu Wu; Linhui Zhang; Chun Gao; Kui Jiang; Yun Liu; Huiqun Wu

doi:10.2196/26634

Journal of Medical Internet Research (Mar 2022)

Machine Learning Prediction Models for Gestational Diabetes Mellitus: Meta-analysis

Zheqing Zhang,
Luqian Yang,
Wentao Han,
Yaoyu Wu,
Linhui Zhang,
Chun Gao,
Kui Jiang,
Yun Liu,
Huiqun Wu

Affiliations

Zheqing Zhang: ORCiD
Luqian Yang: ORCiD
Wentao Han: ORCiD
Yaoyu Wu: ORCiD
Linhui Zhang: ORCiD
Chun Gao: ORCiD
Kui Jiang: ORCiD
Yun Liu: ORCiD
Huiqun Wu: ORCiD

DOI: https://doi.org/10.2196/26634
Journal volume & issue: Vol. 24, no. 3
p. e26634

Abstract

Read online

BackgroundGestational diabetes mellitus (GDM) is a common endocrine metabolic disease, involving a carbohydrate intolerance of variable severity during pregnancy. The incidence of GDM-related complications and adverse pregnancy outcomes has declined, in part, due to early screening. Machine learning (ML) models are increasingly used to identify risk factors and enable the early prediction of GDM. ObjectiveThe aim of this study was to perform a meta-analysis and comparison of published prognostic models for predicting the risk of GDM and identify predictors applicable to the models. MethodsFour reliable electronic databases were searched for studies that developed ML prediction models for GDM in the general population instead of among high-risk groups only. The novel Prediction Model Risk of Bias Assessment Tool (PROBAST) was used to assess the risk of bias of the ML models. The Meta-DiSc software program (version 1.4) was used to perform the meta-analysis and determination of heterogeneity. To limit the influence of heterogeneity, we also performed sensitivity analyses, a meta-regression, and subgroup analysis. ResultsA total of 25 studies that included women older than 18 years without a history of vital disease were analyzed. The pooled area under the receiver operating characteristic curve (AUROC) for ML models predicting GDM was 0.8492; the pooled sensitivity was 0.69 (95% CI 0.68-0.69; P<.001; I2=99.6%) and the pooled specificity was 0.75 (95% CI 0.75-0.75; P<.001; I2=100%). As one of the most commonly employed ML methods, logistic regression achieved an overall pooled AUROC of 0.8151, while non–logistic regression models performed better, with an overall pooled AUROC of 0.8891. Additionally, maternal age, family history of diabetes, BMI, and fasting blood glucose were the four most commonly used features of models established by the various feature selection methods. ConclusionsCompared to current screening strategies, ML methods are attractive for predicting GDM. To expand their use, the importance of quality assessments and unified diagnostic criteria should be further emphasized.

Published in Journal of Medical Internet Research

ISSN: 1438-8871 (Online)
Publisher: JMIR Publications
Country of publisher: Canada
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Medicine: Public aspects of medicine
Website: https://www.jmir.org

About the journal