Explainable machine learning model for predicting the risk of significant liver fibrosis in patients with diabetic retinopathy

Gangfeng Zhu; Na Yang; Qiang Yi; Rui Xu; Liangjian Zheng; Yunlong Zhu; Junyan Li; Jie Che; Cixiang Chen; Zenghong Lu; Li Huang; Yi Xiang; Tianlei Zheng

doi:10.1186/s12911-024-02749-z

BMC Medical Informatics and Decision Making (Nov 2024)

Explainable machine learning model for predicting the risk of significant liver fibrosis in patients with diabetic retinopathy

Gangfeng Zhu,
Na Yang,
Qiang Yi,
Rui Xu,
Liangjian Zheng,
Yunlong Zhu,
Junyan Li,
Jie Che,
Cixiang Chen,
Zenghong Lu,
Li Huang,
Yi Xiang,
Tianlei Zheng

Affiliations

Gangfeng Zhu: The First Clinical Medical College, Gannan Medical University
Na Yang: The Engineering Research Center of Intelligent Theranostics Technology and Instruments, Ministry of Education, School of Biomedical Engineering and Informatics, Nanjing Medical University
Qiang Yi: The First Clinical Medical College, Gannan Medical University
Rui Xu: Department of Rehabilitation Medicine, Affiliated Jinhua Hospital, Zhejiang University School of Medicine
Liangjian Zheng: The First Clinical Medical College, Gannan Medical University
Yunlong Zhu: The First Clinical Medical College, Gannan Medical University
Junyan Li: The First Clinical Medical College, Gannan Medical University
Jie Che: The First Clinical Medical College, Gannan Medical University
Cixiang Chen: The First Clinical Medical College, Gannan Medical University
Zenghong Lu: Department of Oncology, The First Affiliated Hospital, Gannan Medical University
Li Huang: Department of Oncology, The First Affiliated Hospital, Gannan Medical University
Yi Xiang: Department of Oncology, The First Affiliated Hospital, Gannan Medical University
Tianlei Zheng: Artificial Intelligence Unit, Department of Medical Equipment Management, Affiliated Hospital of Xuzhou Medical University

DOI: https://doi.org/10.1186/s12911-024-02749-z
Journal volume & issue: Vol. 24, no. 1
pp. 1 – 14

Abstract

Read online

Abstract Background Diabetic retinopathy (DR), a prevalent complication in patients with type 2 diabetes, has attracted increasing attention. Recent studies have explored a plausible association between retinopathy and significant liver fibrosis. The aim of this investigation was to develop a sophisticated machine learning (ML) model, leveraging comprehensive clinical datasets, to forecast the likelihood of significant liver fibrosis in patients with retinopathy and to interpret the ML model by applying the SHapley Additive exPlanations (SHAP) method. Methods This inquiry was based on data from the National Health and Nutrition Examination Survey 2005–2008 cohort. Utilizing the Fibrosis-4 index (FIB-4), liver fibrosis was stratified across a spectrum of grades (F0-F4). The severity of retinopathy was determined using retinal imaging and segmented into four discrete gradations. A ten-fold cross-validation approach was used to gauge the propensity towards liver fibrosis. Eight ML methodologies were used: Extreme Gradient Boosting, Random Forest, multilayer perceptron, Support Vector Machines, Logistic Regression (LR), Plain Bayes, Decision Tree, and k-nearest neighbors. The efficacy of these models was gauged using metrics, such as the area under the curve (AUC). The SHAP method was deployed to unravel the intricacies of feature importance and explicate the inner workings of the ML model. Results The analysis included 5,364 participants, of whom 2,116 (39.45%) exhibited notable liver fibrosis. Following random allocation, 3,754 individuals were assigned to the training set and 1,610 were allocated to the validation cohort. Nine variables were curated for integration into the ML model. Among the eight ML models scrutinized, the LR model attained zenith in both AUC (0.867, 95% CI: 0.855–0.878) and F1 score (0.749, 95% CI: 0.732–0.767). In internal validation, this model sustained its superiority, with an AUC of 0.850 and an F1 score of 0.736, surpassing all other ML models. The SHAP methodology unveils the foremost factors through importance ranking. Conclusion Sophisticated ML models were crafted using clinical data to discern the propensity for significant liver fibrosis in patients with retinopathy and to intervene early. Practice implications Improved early detection of liver fibrosis risk in retinopathy patients enhances clinical intervention outcomes.

Published in BMC Medical Informatics and Decision Making

ISSN: 1472-6947 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: http://bmcmedinformdecismak.biomedcentral.com

About the journal

Abstract

Keywords