A multilayer multimodal detection and prediction model based on explainable artificial intelligence for Alzheimer’s disease

Shaker El-Sappagh; Jose M. Alonso; S. M. Riazul Islam; Ahmad M. Sultan; Kyung Sup Kwak

doi:10.1038/s41598-021-82098-3

Scientific Reports (Jan 2021)

A multilayer multimodal detection and prediction model based on explainable artificial intelligence for Alzheimer’s disease

Shaker El-Sappagh,
Jose M. Alonso,
S. M. Riazul Islam,
Ahmad M. Sultan,
Kyung Sup Kwak

Affiliations

Shaker El-Sappagh: Centro Singular de Investigación en Tecnoloxías Intelixentes (CiTIUS), Universidade de Santiago de Compostela
Jose M. Alonso: Centro Singular de Investigación en Tecnoloxías Intelixentes, Universidade de Santiago de Compostela
S. M. Riazul Islam: Department of Computer Science and Engineering, Sejong University
Ahmad M. Sultan: Gastrointestinal Surgical Center, Faculty of Medicine, Mansoura University
Kyung Sup Kwak: Department of Information and Communication Engineering, Inha University

DOI: https://doi.org/10.1038/s41598-021-82098-3
Journal volume & issue: Vol. 11, no. 1
pp. 1 – 26

Abstract

Read online

Abstract Alzheimer’s disease (AD) is the most common type of dementia. Its diagnosis and progression detection have been intensively studied. Nevertheless, research studies often have little effect on clinical practice mainly due to the following reasons: (1) Most studies depend mainly on a single modality, especially neuroimaging; (2) diagnosis and progression detection are usually studied separately as two independent problems; and (3) current studies concentrate mainly on optimizing the performance of complex machine learning models, while disregarding their explainability. As a result, physicians struggle to interpret these models, and feel it is hard to trust them. In this paper, we carefully develop an accurate and interpretable AD diagnosis and progression detection model. This model provides physicians with accurate decisions along with a set of explanations for every decision. Specifically, the model integrates 11 modalities of 1048 subjects from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) real-world dataset: 294 cognitively normal, 254 stable mild cognitive impairment (MCI), 232 progressive MCI, and 268 AD. It is actually a two-layer model with random forest (RF) as classifier algorithm. In the first layer, the model carries out a multi-class classification for the early diagnosis of AD patients. In the second layer, the model applies binary classification to detect possible MCI-to-AD progression within three years from a baseline diagnosis. The performance of the model is optimized with key markers selected from a large set of biological and clinical measures. Regarding explainability, we provide, for each layer, global and instance-based explanations of the RF classifier by using the SHapley Additive exPlanations (SHAP) feature attribution framework. In addition, we implement 22 explainers based on decision trees and fuzzy rule-based systems to provide complementary justifications for every RF decision in each layer. Furthermore, these explanations are represented in natural language form to help physicians understand the predictions. The designed model achieves a cross-validation accuracy of 93.95% and an F1-score of 93.94% in the first layer, while it achieves a cross-validation accuracy of 87.08% and an F1-Score of 87.09% in the second layer. The resulting system is not only accurate, but also trustworthy, accountable, and medically applicable, thanks to the provided explanations which are broadly consistent with each other and with the AD medical literature. The proposed system can help to enhance the clinical understanding of AD diagnosis and progression processes by providing detailed insights into the effect of different modalities on the disease risk.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal