Heart Disease Risk Prediction Using Machine Learning Classifiers with Attribute Evaluators

Karna Vishnu Vardhana Reddy; Irraivan Elamvazuthi; Azrina Abd Aziz; Sivajothi Paramasivam; Hui Na Chua; S. Pranavanand

doi:10.3390/app11188352

Applied Sciences (Sep 2021)

Heart Disease Risk Prediction Using Machine Learning Classifiers with Attribute Evaluators

Karna Vishnu Vardhana Reddy,
Irraivan Elamvazuthi,
Azrina Abd Aziz,
Sivajothi Paramasivam,
Hui Na Chua,
S. Pranavanand

Affiliations

Karna Vishnu Vardhana Reddy: Department of Electrical and Electronics Engineering, Universiti Teknologi PETRONS, Seri Iskandar 32610, Malaysia
Irraivan Elamvazuthi: Department of Electrical and Electronics Engineering, Universiti Teknologi PETRONS, Seri Iskandar 32610, Malaysia
Azrina Abd Aziz: Department of Electrical and Electronics Engineering, Universiti Teknologi PETRONS, Seri Iskandar 32610, Malaysia
Sivajothi Paramasivam: School of Engineering, UOWM KDU University College, Shah Alam 40150, Malaysia
Hui Na Chua: Department of Computing and Information Systems, School of Engineering, and Technology, Sunway University, Petaling Jaya 47500, Malaysia
S. Pranavanand: Department of E.I.E, VNR Vignana Jyothi Institute of Engineering and Technology, Hyderabad 500090, India

DOI: https://doi.org/10.3390/app11188352
Journal volume & issue: Vol. 11, no. 18
p. 8352

Abstract

Read online

Cardiovascular diseases (CVDs) kill about 20.5 million people every year. Early prediction can help people to change their lifestyles and to ensure proper medical treatment if necessary. In this research, ten machine learning (ML) classifiers from different categories, such as Bayes, functions, lazy, meta, rules, and trees, were trained for efficient heart disease risk prediction using the full set of attributes of the Cleveland heart dataset and the optimal attribute sets obtained from three attribute evaluators. The performance of the algorithms was appraised using a 10-fold cross-validation testing option. Finally, we performed tuning of the hyperparameter number of nearest neighbors, namely, ‘k’ in the instance-based (IBk) classifier. The sequential minimal optimization (SMO) achieved an accuracy of 85.148% using the full set of attributes and 86.468% was the highest accuracy value using the optimal attribute set obtained from the chi-squared attribute evaluator. Meanwhile, the meta classifier bagging with logistic regression (LR) provided the highest ROC area of 0.91 using both the full and optimal attribute sets obtained from the ReliefF attribute evaluator. Overall, the SMO classifier stood as the best prediction method compared to other techniques, and IBk achieved an 8.25% accuracy improvement by tuning the hyperparameter ‘k’ to 9 with the chi-squared attribute set.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords