Applied Sciences (Feb 2023)

Efficient Data Preprocessing with Ensemble Machine Learning Technique for the Early Detection of Chronic Kidney Disease

  • Vinoth Kumar Venkatesan,
  • Mahesh Thyluru Ramakrishna,
  • Ivan Izonin,
  • Roman Tkachenko,
  • Myroslav Havryliuk

DOI
https://doi.org/10.3390/app13052885
Journal volume & issue
Vol. 13, no. 5
p. 2885

Abstract

Read online

It is a serious global health concern that chronic kidney disease (CKD) kills millions of people each year as a result of poor lifestyle choices and inherited factors. Effective prediction tools for prior detection are essential due to the growing number of patients with this disease. By utilizing machine learning (ML) approaches, this study aids specialists in studying precautionary measures for CKD through prior detection. The main objective of this paper is to predict and classify chronic kidney disease using ML approaches on a publicly available dataset. The dataset of CKD has been taken from the publicly available and accessible dataset Irvine ML Repository, which included 400 instances. ML methods (Support Vector Machine (SVM), K-Nearest Neighbors (KNN), random forest (RF), Logistic Regression (LR), and Decision Tree (DT) Classifier) are used as base learners and their performance has been compared with eXtreme Gradient Boosting (XGBoost). All ML algorithms are evaluated against different performance parameters: accuracy, recall, precision, and F1-measure. The results indicated that XGBoost outperformed with 98.00% accuracy as compared to other ML algorithms. For policymakers to forecast patterns of CKD in the population, the model put forth in this paper may be helpful. The model may enable careful monitoring of individuals who are at risk, early CKD detection, better resource allocation, and management that is patient-centered.

Keywords