Heart Disease Prediction Using Novel Ensemble and Blending Based Cardiovascular Disease Detection Networks: EnsCVDD-Net and BlCVDD-Net

Hira Khan; Nadeem Javaid; Tariq Bashir; Mariam Akbar; Nabil Alrajeh; Sheraz Aslam

doi:10.1109/ACCESS.2024.3421241

IEEE Access (Jan 2024)

Heart Disease Prediction Using Novel Ensemble and Blending Based Cardiovascular Disease Detection Networks: EnsCVDD-Net and BlCVDD-Net

Hira Khan,
Nadeem Javaid,
Tariq Bashir,
Mariam Akbar,
Nabil Alrajeh,
Sheraz Aslam

Affiliations

Hira Khan: Department of Electrical and Computer Engineering, COMSATS University Islamabad, Islamabad, Pakistan
Nadeem Javaid: ORCiD; Department of Computer Science, COMSATS University Islamabad, Islamabad, Pakistan
Tariq Bashir: ORCiD; Department of Electrical and Computer Engineering, COMSATS University Islamabad, Islamabad, Pakistan
Mariam Akbar: Department of Computer Science, COMSATS University Islamabad, Islamabad, Pakistan
Nabil Alrajeh: ORCiD; Department of Biomedical Technology, College of Applied Medical Sciences, King Saud University, Riyadh, Saudi Arabia
Sheraz Aslam: ORCiD; Department of Electrical Engineering, Computer Engineering and Informatics (EECEI), Cyprus University of Technology, Limassol, Cyprus

DOI: https://doi.org/10.1109/ACCESS.2024.3421241
Journal volume & issue: Vol. 12
pp. 109230 – 109254

Abstract

Read online

Cardiovascular Diseases (CVDs) have emerged as a significant physiological condition, being a primary contributor to mortality. Timely and precise diagnosis of heart disease is crucial to safeguard patients from additional harm. Recent studies show that the usage of data driven approaches, such as Deep Learning (DL) and Machine Learning (ML) techniques, in the field of medical science is highly useful in accurately diagnosing heart disease in less time. However, statistical learning and traditional ML approaches require feature engineering to generate robust and effective features from data, which are then used in the prediction models. In the case of large complex data, both processes pose many challenges. Whereas, DL techniques are capable of learning features automatically from the data and are effective at handling large and intricate datasets while outperforming the ML models. This study focuses on the accurate prediction of CVDs, considering the patient’s health and socio-economic conditions while mitigating the challenges presented by imbalanced data. The Adaptive Synthetic Sampling Technique is used for data balancing, while the Point Biserial Correlation Coefficient is used as a feature selection technique. In this study, two DL models, Ensemble based Cardiovascular Disease Detection Network (EnsCVDD-Net) and Blending based Cardiovascular Disease Detection Network (BlCVDD-Net), are proposed for accurate prediction and classification of CVDs. EnsCVDD-Net is made by applying an ensemble technique to LeNet and Gated Recurrent Unit (GRU), and BlCVDD-Net is made by blending LeNet, GRU and Multilayer Perceptron. SHapley Additive exPlanations is used to provide a clear understanding of the influence different factors have on CVD diagnosis. The network’s performance is evaluated on the basis of various performance metrics. The results indicate that the EnsCVDD-Net outperforms all base models with 88% accuracy, 88% F1-score, 91% precision, 85% recall, and 777s execution time. Similarly, with 91% accuracy, 91% F1-score, 96% precision, 86% recall, and 247s execution time, BlCVDD-Net outperforms the state-of-the-art DL models. To validate the model’s results, 10-Fold Cross Validation is employed. An eXplainable Artificial Intelligence technique, SHapley Additive exPlanation is employed to know the features contribution in model’s predictions.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords