Science, Engineering and Technology (Dec 2024)

A Comparison of Machine Learning Algorithms for Predicting Alzheimer’s Disease Using Neuropsychological Data

  • Zakaria Mokadem,
  • Mohamed Djerioui,
  • Bilal Attallah,
  • Youcef Brik

DOI
https://doi.org/10.54327/set2025/v5.i1.182
Journal volume & issue
Vol. 5, no. 1

Abstract

Read online

Alzheimer’s disease (AD) is a gradient degeneration of essential cognitive activities such as memory, thinking, and cognition. AD mainly affects elderly individuals and is recognized as the most common cause of dementia. This study investigates the predictive performance of nine supervised machine learning algorithms—Logistic Regression, Decision Tree, Random Forest, K-Nearest Neighbors, Support Vector Machine, Gaussian Naïve Bayes, Multi-Layer Perceptron, eXtreme Gradient Boost, and Gradient Boosting—using neuropsychological assessment data. We applied two classification techniques—binary and multiclass—to classify 1761 subjects into three categories: cognitively normal (CN), mild cognitive impairment (MCI), and Alzheimer's disease (AD). Binary classification tasks focused on CNvsAD and CNvsMCI subsets, while multiclass classification used the full dataset (TriClass). Hyperparameter tuning was performed to optimize model performance. The results indicate that ensemble learning models, particularly Gradient Boosting (GB) and Random Forest (RF), exhibited superior accuracy compared to other algorithms. Most models for the CNvsAD subset achieved the highest accuracy (97.74%), while GB achieved the best performance (94.98%) for the CNvsMCI subset. For multiclass classification, RF achieved the highest accuracy at 84.70%. These findings highlight the robustness and efficiency of ensemble learning algorithms, especially in handling complex, non-linear data structures. This study underscores the potential of RF and GB as reliable tools for early detection and classification of Alzheimer’s disease using neuropsychological data.

Keywords