Machine Learning Approaches for Stroke Risk Prediction: Findings from the Suita Study

Thien Vu; Yoshihiro Kokubo; Mai Inoue; Masaki Yamamoto; Attayeb Mohsen; Agustin Martin-Morales; Takao Inoué; Research Dawadi; Michihiro Araki

doi:10.3390/jcdd11070207

Journal of Cardiovascular Development and Disease (Jul 2024)

Machine Learning Approaches for Stroke Risk Prediction: Findings from the Suita Study

Thien Vu,
Yoshihiro Kokubo,
Mai Inoue,
Masaki Yamamoto,
Attayeb Mohsen,
Agustin Martin-Morales,
Takao Inoué,
Research Dawadi,
Michihiro Araki

Affiliations

Thien Vu: Artificial Intelligence Center for Health and Biomedical Research, National Institutes of Biomedical Innovation, Health and Nutrition, 3-17 Senrioka-Shinmachi, Settsu 566-0002, Japan
Yoshihiro Kokubo: National Cerebral and Cardiovascular Center, 6-1 Kishibe-Shinmachi, Suita 564-8565, Japan
Mai Inoue: Artificial Intelligence Center for Health and Biomedical Research, National Institutes of Biomedical Innovation, Health and Nutrition, 3-17 Senrioka-Shinmachi, Settsu 566-0002, Japan
Masaki Yamamoto: Artificial Intelligence Center for Health and Biomedical Research, National Institutes of Biomedical Innovation, Health and Nutrition, 3-17 Senrioka-Shinmachi, Settsu 566-0002, Japan
Attayeb Mohsen: Artificial Intelligence Center for Health and Biomedical Research, National Institutes of Biomedical Innovation, Health and Nutrition, 3-17 Senrioka-Shinmachi, Settsu 566-0002, Japan
Agustin Martin-Morales: Artificial Intelligence Center for Health and Biomedical Research, National Institutes of Biomedical Innovation, Health and Nutrition, 3-17 Senrioka-Shinmachi, Settsu 566-0002, Japan
Takao Inoué: Faculty of Informatics, Yamato University, 2-5-1 Katayama, Suita 564-0082, Japan
Research Dawadi: Artificial Intelligence Center for Health and Biomedical Research, National Institutes of Biomedical Innovation, Health and Nutrition, 3-17 Senrioka-Shinmachi, Settsu 566-0002, Japan
Michihiro Araki: Artificial Intelligence Center for Health and Biomedical Research, National Institutes of Biomedical Innovation, Health and Nutrition, 3-17 Senrioka-Shinmachi, Settsu 566-0002, Japan

DOI: https://doi.org/10.3390/jcdd11070207
Journal volume & issue: Vol. 11, no. 7
p. 207

Abstract

Read online

Stroke constitutes a significant public health concern due to its impact on mortality and morbidity. This study investigates the utility of machine learning algorithms in predicting stroke and identifying key risk factors using data from the Suita study, comprising 7389 participants and 53 variables. Initially, unsupervised k-prototype clustering categorized participants into risk clusters, while five supervised models including Logistic Regression (LR), Random Forest (RF), Support Vector Machine (SVM), Extreme Gradient Boosting (XGBoost), and Light Gradient Boosted Machine (LightGBM) were employed to predict stroke outcomes. Stroke incidence disparities among identified risk clusters using the unsupervised k-prototype clustering method are substantial, according to the findings. Supervised learning, particularly RF, was a preferable option because of the higher levels of performance metrics. The Shapley Additive Explanations (SHAP) method identified age, systolic blood pressure, hypertension, estimated glomerular filtration rate, metabolic syndrome, and blood glucose level as key predictors of stroke, aligning with findings from the unsupervised clustering approach in high-risk groups. Additionally, previously unidentified risk factors such as elbow joint thickness, fructosamine, hemoglobin, and calcium level demonstrate potential for stroke prediction. In conclusion, machine learning facilitated accurate stroke risk predictions and highlighted potential biomarkers, offering a data-driven framework for risk assessment and biomarker discovery.

Published in Journal of Cardiovascular Development and Disease

ISSN: 2308-3425 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Medicine: Internal medicine: Specialties of internal medicine: Diseases of the circulatory (Cardiovascular) system
Website: http://www.mdpi.com/journal/jcdd

About the journal

Abstract

Keywords