International Journal of General Medicine (Dec 2024)

Construction of Clinical Predictive Models for Heart Failure Detection Using Six Different Machine Learning Algorithms: Identification of Key Clinical Prognostic Features

  • Qu FZ,
  • Ding J,
  • An XF,
  • Peng R,
  • He N,
  • Liu S,
  • Jiang X

Journal volume & issue
Vol. Volume 17
pp. 6523 – 6534

Abstract

Read online

Fang Zhou Qu,1 Jiang Ding,2 Xi Feng An,3 Rui Peng,4 Ni He,5 Sheng Liu,1 Xin Jiang5 1Medical School, Xizang Minzu University, Xianyang, People’s Republic of China; 2Institute of Electrical Power Systems, Graz University of Technology, Graz, Austria; 3The First Affiliated Hospital of Jinan University, Guangzhou, People’s Republic of China; 4Affiliated Nanhua Hospital, University of South China, Hengyang, People’s Republic of China; 5Department of Cardiology, Shaanxi Provincial People’s Hospital, Xi’an, People’s Republic of ChinaCorrespondence: Xin Jiang, Department of Cardiology, Shaanxi Provincial People’s Hospital, Xi’an, Shanxi Province, People’s Republic of China, Email [email protected]: Heart failure (HF) is a clinical syndrome in which structural or functional abnormalities of the heart result in impaired ventricular filling or ejection capacity. In order to improve the adaptability of models to different patient populations and data situations. This study aims to develop predictive models for HF risk using six machine learning algorithms, providing valuable insights into the early assessment and recognition of HF by clinical features.Patients and Methods: The present study focused on clinical characteristics that significantly differed between groups with left ventricular ejection fractions (LVEF) [≤ 40% and > 40%]. Following the elimination of features with significant missing values, the remaining features were utilized to construct predictive models employing six machine learning algorithms. The optimal model was selected based on various performance metrics, including the area under the curve (AUC), accuracy, precision, recall, and F1 score. Utilizing the optimal model, the significance of clinical features was assessed, and those with importance values exceeding 0.8 were identified as crucial to the study. Finally, a correlation analysis was conducted to examine the relationships between these features and other significant clinical features.Results: The logistic regression (LR) model was determined to be the optimal machine learning algorithm in this study, achieving an accuracy of 0.64, a precision of 0.45, a recall of 0.72, an F1 score of 0.51, and an AUC of 0.81 in the training set and 0.91 in the testing set. In addition, the analysis of feature importance indicated that blood calcium, angiotensin-converting enzyme inhibitors (ACEI) dosage, mean hemoglobin concentration, and survival duration were critical to the study, each possessing importance values exceeding 0.8. Furthermore, correlation analysis revealed a strong relationship between blood calcium and ionized calcium (|cor|=0.99), as well as a significant association between ACEI dosage (|cor|=0.68) and left ventricular metrics (|cor|=0.58); on the other hand, no correlations were observed between mean hemoglobin levels and other clinical characteristics.Conclusion: The present study identified LR as the most effective risk prediction model for patients with HF, highlighting blood calcium, ACEI dosage, and mean hemoglobin level as significant predictors. These findings provide significant insights for the clinical prevention and early intervention of HF.Keywords: left ventricular ejection fractions, logistic regression, blood calcium, area under the curve, correlation analysis

Keywords