PeerJ (Nov 2024)
Applying stacking ensemble method to predict chronic kidney disease progression in Chinese population based on laboratory information system: a retrospective study
Abstract
Background and Objective Chronic kidney disease (CKD) is a major public health issue, and accurate prediction of the progression of kidney failure is critical for clinical decision-making and helps improve patient outcomes. As such, we aimed to develop and externally validate a machine-learned model to predict the progression of CKD using common laboratory variables, demographic characteristics, and an electronic health records database. Methods We developed a predictive model using longitudinal clinical data from a single center for Chinese CKD patients. The cohort included 987 patients who were followed up for more than 24 months. Fifty-three laboratory features were considered for inclusion in the model. The primary outcome in our study was an estimated glomerular filtration rate ≤15 mL/min/1.73 m2 or kidney failure. Machine learning algorithms were applied to the modeling dataset (n = 296), and an external dataset (n = 71) was used for model validation. We assessed model discrimination via area under the curve (AUC) values, accuracy, sensitivity, specificity, positive predictive value, negative predictive value, and F1 score. Results Over a median follow-up period of 3.75 years, 148 patients experienced kidney failure. The optimal model was based on stacking different classifier algorithms with six laboratory features, including 24-h urine protein, potassium, glucose, urea, prealbumin and total protein. The model had considerable predictive power, with AUC values of 0.896 and 0.771 in the validation and external datasets, respectively. This model also accurately predicted the progression of renal function in patients over different follow-up periods after their initial assessment. Conclusions A prediction model that leverages routinely collected laboratory features in the Chinese population can accurately identify patients with CKD at high risk of progressing to kidney failure. An online version of the model can be easily and quickly applied in clinical management and treatment.
Keywords