Pre-existing and machine learning-based models for cardiovascular risk prediction

Sang-Yeong Cho; Sun-Hwa Kim; Si-Hyuck Kang; Kyong Joon Lee; Dongjun Choi; Seungjin Kang; Sang Jun Park; Tackeun Kim; Chang-Hwan Yoon; Tae-Jin Youn; In-Ho Chae

doi:10.1038/s41598-021-88257-w

Scientific Reports (Apr 2021)

Pre-existing and machine learning-based models for cardiovascular risk prediction

Sang-Yeong Cho,
Sun-Hwa Kim,
Si-Hyuck Kang,
Kyong Joon Lee,
Dongjun Choi,
Seungjin Kang,
Sang Jun Park,
Tackeun Kim,
Chang-Hwan Yoon,
Tae-Jin Youn,
In-Ho Chae

Affiliations

Sang-Yeong Cho: Department of Cardiology, Gyeongsang National University School of Medicine and Gyeongsang National University Changwon Hospital
Sun-Hwa Kim: Cardiovascular Center, Internal Medicine, Seoul National University Bundang Hospital
Si-Hyuck Kang: Cardiovascular Center, Internal Medicine, Seoul National University Bundang Hospital
Kyong Joon Lee: Department of Radiology, Seoul National University Bundang Hospital, Seoul National University College of Medicine
Dongjun Choi: Department of Radiology, Seoul National University Bundang Hospital, Seoul National University College of Medicine
Seungjin Kang: Office of eHealth Research and Businesses, Seoul National University Bundang Hospital
Sang Jun Park: Department of Ophthalmology, Seoul National University Bundang Hospital, Seoul National University College of Medicine
Tackeun Kim: Department of Neurosurgery, Seoul National University Bundang Hospital, Seoul National University College of Medicine
Chang-Hwan Yoon: Cardiovascular Center, Internal Medicine, Seoul National University Bundang Hospital
Tae-Jin Youn: Cardiovascular Center, Internal Medicine, Seoul National University Bundang Hospital
In-Ho Chae: Cardiovascular Center, Internal Medicine, Seoul National University Bundang Hospital

DOI: https://doi.org/10.1038/s41598-021-88257-w
Journal volume & issue: Vol. 11, no. 1
pp. 1 – 10

Abstract

Read online

Abstract Predicting the risk of cardiovascular disease is the key to primary prevention. Machine learning has attracted attention in analyzing increasingly large, complex healthcare data. We assessed discrimination and calibration of pre-existing cardiovascular risk prediction models and developed machine learning-based prediction algorithms. This study included 222,998 Korean adults aged 40–79 years, naïve to lipid-lowering therapy, had no history of cardiovascular disease. Pre-existing models showed moderate to good discrimination in predicting future cardiovascular events (C-statistics 0.70–0.80). Pooled cohort equation (PCE) specifically showed C-statistics of 0.738. Among other machine learning models such as logistic regression, treebag, random forest, and adaboost, the neural network model showed the greatest C-statistic (0.751), which was significantly higher than that for PCE. It also showed improved agreement between the predicted risk and observed outcomes (Hosmer–Lemeshow χ2 = 86.1, P < 0.001) than PCE for whites did (Hosmer–Lemeshow χ2 = 171.1, P < 0.001). Similar improvements were observed for Framingham risk score, systematic coronary risk evaluation, and QRISK3. This study demonstrated that machine learning-based algorithms could improve performance in cardiovascular risk prediction over contemporary cardiovascular risk models in statin-naïve healthy Korean adults without cardiovascular disease. The model can be easily adopted for risk assessment and clinical decision making.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal