Journal of Engineering and Applied Science (Oct 2023)

Early prediction of heart disease with data analysis using supervised learning with stochastic gradient boosting

  • Anil Pandurang Jawalkar,
  • Pandla Swetcha,
  • Nuka Manasvi,
  • Pakki Sreekala,
  • Samudrala Aishwarya,
  • Potru Kanaka Durga Bhavani,
  • Pendem Anjani

DOI
https://doi.org/10.1186/s44147-023-00280-y
Journal volume & issue
Vol. 70, no. 1
pp. 1 – 18

Abstract

Read online

Abstract Heart diseases are consistently ranked among the top causes of mortality on a global scale. Early detection and accurate heart disease prediction can help effectively manage and prevent the disease. However, the traditional methods have failed to improve heart disease classification performance. So, this article proposes a machine learning approach for heart disease prediction (HDP) using a decision tree-based random forest (DTRF) classifier with loss optimization. Initially, preprocessing of the dataset with patient records with known labels is performed for the presence or absence of heart disease records. Then, train a DTRF classifier on the dataset using stochastic gradient boosting (SGB) loss optimization technique and evaluate the classifier’s performance using a separate test dataset. The results demonstrate that the proposed HDP-DTRF approach resulted in 86% of precision, 86% of recall, 85% of F1-score, and 96% of accuracy on publicly available real-world datasets, which are higher than traditional methods.

Keywords