Healthcare Analytics (Dec 2024)

A novel integrated logistic regression model enhanced with recursive feature elimination and explainable artificial intelligence for dementia prediction

  • Rasel Ahmed,
  • Nafiz Fahad,
  • Md Saef Ullah Miah,
  • Md. Jakir Hossen,
  • Md. Kishor Morol,
  • Mufti Mahmud,
  • M. Mostafizur Rahman

Journal volume & issue
Vol. 6
p. 100362

Abstract

Read online

Dementia is a major global health issue that significantly impacts millions of individuals, families, and societies worldwide, creating a substantial burden on healthcare systems. This study introduces a novel approach for predicting dementia by employing the Logistic Regression (LR) model, enhanced with Recursive Feature Elimination (RFE), applied to a unique dataset comprising 1000 patients, with 49.60% male and 50.40% female. The LR model, recognized for its simplicity and effectiveness in binary classification tasks, is optimized through RFE, a technique that iteratively eliminates less significant features to improve model performance. The model’s effectiveness was assessed using comprehensive metrics, including accuracy, precision, recall, F1-score, Matthews Correlation Coefficient (MCC), and Kappa score. Furthermore, SHapley Additive exPlanations (SHAP) values were employed to increase the interpretability of the model, providing insights into the most influential features for dementia prediction. To address the issue of overfitting, a standardization technique was implemented, which enhanced the model’s predictive performance. The findings of this study hold potential implications for early dementia detection, informing intervention strategies, and optimizing healthcare resource allocation.

Keywords