Journal of Personalized Medicine (Sep 2023)

From Admission to Discharge: Predicting National Institutes of Health Stroke Scale Progression in Stroke Patients Using Biomarkers and Explainable Machine Learning

  • Aimilios Gkantzios,
  • Christos Kokkotis,
  • Dimitrios Tsiptsios,
  • Serafeim Moustakidis,
  • Elena Gkartzonika,
  • Theodoros Avramidis,
  • Gregory Tripsianis,
  • Ioannis Iliopoulos,
  • Nikolaos Aggelousis,
  • Konstantinos Vadikolias

DOI
https://doi.org/10.3390/jpm13091375
Journal volume & issue
Vol. 13, no. 9
p. 1375

Abstract

Read online

As a result of social progress and improved living conditions, which have contributed to a prolonged life expectancy, the prevalence of strokes has increased and has become a significant phenomenon. Despite the available stroke treatment options, patients frequently suffer from significant disability after a stroke. Initial stroke severity is a significant predictor of functional dependence and mortality following an acute stroke. The current study aims to collect and analyze data from the hyperacute and acute phases of stroke, as well as from the medical history of the patients, in order to develop an explainable machine learning model for predicting stroke-related neurological deficits at discharge, as measured by the National Institutes of Health Stroke Scale (NIHSS). More specifically, we approached the data as a binary task problem: improvement of NIHSS progression vs. worsening of NIHSS progression at discharge, using baseline data within the first 72 h. For feature selection, a genetic algorithm was applied. Using various classifiers, we found that the best scores were achieved from the Random Forest (RF) classifier at the 15 most informative biomarkers and parameters for the binary task of the prediction of NIHSS score progression. RF achieved 91.13% accuracy, 91.13% recall, 90.89% precision, 91.00% f1-score, 8.87% FNrate and 4.59% FPrate. Those biomarkers are: age, gender, NIHSS upon admission, intubation, history of hypertension and smoking, the initial diagnosis of hypertension, diabetes, dyslipidemia and atrial fibrillation, high-density lipoprotein (HDL) levels, stroke localization, systolic blood pressure levels, as well as erythrocyte sedimentation rate (ESR) levels upon admission and the onset of respiratory infection. The SHapley Additive exPlanations (SHAP) model interpreted the impact of the selected features on the model output. Our findings suggest that the aforementioned variables may play a significant role in determining stroke patients’ NIHSS progression from the time of admission until their discharge.

Keywords