Reviews in Cardiovascular Medicine (Aug 2023)
Development, External Validation, and Visualization of Machine Learning Models for Predicting Occurrence of Acute Kidney Injury after Cardiac Surgery
Abstract
Background: Cardiac surgery-associated acute kidney injury (CSA-AKI) is a major complication that results in short- and long-term mortality among patients. Here, we adopted machine learning algorithms to build prediction models with the overarching goal of identifying patients who are at a high risk of such unfavorable kidney outcomes. Methods: A total of 1686 patients (development cohort) and 422 patients (validation cohort), with 126 pre- and intra-operative variables, were recruited from the First Medical Centre and the Sixth Medical Centre of Chinese PLA General Hospital in Beijing, China, respectively. Analyses were performed using six machine learning techniques, namely K-nearest neighbor, logistic regression, decision tree, random forest (RF), support vector machine, and neural network, and the APPROACH score, a previously established risk score for CSA-AKI. For model tuning, optimal hyperparameter was achieved by using GridSearch with 5-fold cross-validation from the scikit-learn library. Model performance was externally assessed via the receiver operating characteristic (ROC) and decision curve analysis (DCA). Explainable machine learning was performed using the Python SHapley Additive exPlanation (SHAP) package and Seaborn library, which allow the calculation of marginal contributory SHAP value. Results: 637 patients (30.2%) developed CSA-AKI within seven days after surgery. In the external validation, the RF classifier exhibited the best performance among the six machine learning techniques, as shown by the ROC curve and DCA, while the traditional APPROACH risk score showed a relatively poor performance. Further analysis found no specific causative factor contributing to the development of CSA-AKI; rather, the development of CSA-AKI appeared to be a complex process resulting from a complex interplay of multiple risk factors. The SHAP summary plot illustrated the positive or negative contribution of RF-top 20 variables and extrapolated risk of developing CSA-AKI at individual levels. The Seaborn library showed the effect of each single feature on the model output of the RF prediction. Conclusions: Efficient machine learning approaches were successfully established to predict patients with a high probability of developing acute kidney injury after cardiac surgery. These findings are expected to help clinicians to optimize treatment strategies and minimize postoperative complications. Clinical Trial Registration: The study protocol was registered at the ClinicalTrials Registration System (https://www.clinicaltrials.gov/, #NCT04966598) on July 26, 2021.
Keywords