Predictive modeling the probability of suffering from metabolic syndrome using machine learning: A population-based study
Xiang Hu,
Xue-Ke Li,
Shiping Wen,
Xingyu Li,
Tian-Shu Zeng,
Jiao-Yue Zhang,
Weiqing Wang,
Yufang Bi,
Qiao Zhang,
Sheng-Hua Tian,
Jie Min,
Ying Wang,
Geng Liu,
Hantao Huang,
Miaomiao Peng,
Jun Zhang,
Chaodong Wu,
Yu-Ming Li,
Hui Sun,
Guang Ning,
Lu-Lu Chen
Affiliations
Xiang Hu
Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China; Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
Xue-Ke Li
Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China; Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
Shiping Wen
Centre for Artificial Intelligence, Faculty of Engineering Information Technology, University of Technology Sydney, Ultimo, NSW, 2007, Australia
Xingyu Li
School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, China
Tian-Shu Zeng
Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China; Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
Jiao-Yue Zhang
Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China; Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
Weiqing Wang
Department of Endocrinology and Metabolism, State Key Laboratory of Medical Genomes, National Clinical Research Center for Metabolic Diseases, Shanghai Clinical Center for Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao-Tong University School of Medicine, Shanghai, China
Yufang Bi
Department of Endocrinology and Metabolism, State Key Laboratory of Medical Genomes, National Clinical Research Center for Metabolic Diseases, Shanghai Clinical Center for Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao-Tong University School of Medicine, Shanghai, China
Qiao Zhang
Department of Cardiovascular Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
Sheng-Hua Tian
Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China; Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
Jie Min
Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China; Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
Ying Wang
Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China; Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
Geng Liu
Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China; Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
Hantao Huang
Yiling Hospital, Yichang, China
Miaomiao Peng
Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China; Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
Jun Zhang
Yiling Hospital, Yichang, China
Chaodong Wu
Department of Nutrition and Food Science, Texas A&M University, College Station, TX, USA
Yu-Ming Li
Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China; Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
Hui Sun
Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China; Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China
Guang Ning
Department of Endocrinology and Metabolism, State Key Laboratory of Medical Genomes, National Clinical Research Center for Metabolic Diseases, Shanghai Clinical Center for Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao-Tong University School of Medicine, Shanghai, China
Lu-Lu Chen
Department of Endocrinology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China; Hubei Provincial Clinical Research Center for Diabetes and Metabolic Disorders, Wuhan, China; Corresponding author.
Background: There is an increasing trend of Metabolic syndrome (MetS) prevalence, which has been considered as an important contributor for cardiovascular disease (CVD), cancers and diabetes. However, there is often a long asymptomatic phase of MetS, resulting in not diagnosed and intervened so timely as needed. It would be very helpful to explore tools to predict the probability of suffering from MetS in daily life or routinely clinical practice. Objective: To develop models that predict individuals’ probability of suffering from MetS timely with high efficacy in general population. Methods: The present study enrolled 8964 individuals aged 40–75 years without severe diseases, which was a part of the REACTION study from October 2011 to February 2012. We developed three prediction models for different scenarios in hospital (Model 1, 2) or at home (Model 3) based on LightGBM (LGBM) technique and corresponding logistic regression (LR) models were also constructed for comparison. Model 1 included variables of laboratory tests, lifestyles and anthropometric measurements while model 2 was built with components of MetS excluded based on model 1, and model 3 was constructed with blood biochemical indexes removed based on model 2. Additionally, we also investigated the strength of association between the predictive factors and MetS, as well as that between the predictors and each component of MetS. Results: In this study, 2714 (30.3%) participants suffer from MetS accordingly. The performances of the LGBM models in predicting the probability of suffering from MetS produced good results and were presented as follows: model 1 had an area under the curve (AUC) value of 0.993 while model 2 indicated an AUC value of 0.885. Model 3 had an AUC value of 0.859, which is close to that of model 2. The AUC values of LR model 1 and 2 for the scenario in hospital and model 3 at home were 0.938, 0.839 and 0.820 respectively, which seemed lower than that of their corresponding machine learning models, respectively. In both LGBM and logistic models, gender, height and resting pulse rate (RPR) were predictors for MetS. Women had higher risk of MetS than men (OR 8.84, CI: 6.70–11.66), and each 1-cm increase in height indicated 3.8% higher risk of suffering from MetS in people over 58 years, whereas each 1- Beat Per Minute (bpm) increase in RPR showed 1.0% higher risk in individuals younger than 62 years. Conclusion: The present study showed that the prediction models developed by machine learning demonstrated effective in evaluating the probability of suffering from MetS, and presented prominent predicting efficacies and accuracies. Additionally, we found that women showed a higher risk of MetS than men, and height in individuals over 58 years was important factor in predicting the probability of suffering from MetS while RPR was of vital importance in people aged 40–62 years.