Benchmarking explanation methods for mental state decoding with deep learning models

Armin W. Thomas; Christopher Ré; Russell A. Poldrack

NeuroImage (Jun 2023)

Benchmarking explanation methods for mental state decoding with deep learning models

Armin W. Thomas,
Christopher Ré,
Russell A. Poldrack

Affiliations

Armin W. Thomas: Corresponding author.; Stanford Data Science, Stanford University, 450 Serra Mall, 94305, Stanford, USA
Christopher Ré: Dept. of Computer Science, Stanford University, 450 Serra Mall, 94305, Stanford, USA
Russell A. Poldrack: Dept. of Psychology, Stanford University, 450 Serra Mall, Stanford, 94305, USA

Journal volume & issue: Vol. 273
p. 120109

Abstract

Read online

Deep learning (DL) models find increasing application in mental state decoding, where researchers seek to understand the mapping between mental states (e.g., experiencing anger or joy) and brain activity by identifying those spatial and temporal features of brain activity that allow to accurately identify (i.e., decode) these states. Once a DL model has been trained to accurately decode a set of mental states, neuroimaging researchers often make use of methods from explainable artificial intelligence research to understand the model’s learned mappings between mental states and brain activity. Here, we benchmark prominent explanation methods in a mental state decoding analysis of multiple functional Magnetic Resonance Imaging (fMRI) datasets. Our findings demonstrate a gradient between two key characteristics of an explanation in mental state decoding, namely, its faithfulness and its alignment with other empirical evidence on the mapping between brain activity and decoded mental state: explanation methods with high explanation faithfulness, which capture the model’s decision process well, generally provide explanations that align less well with other empirical evidence than the explanations of methods with less faithfulness. Based on our findings, we provide guidance for neuroimaging researchers on how to choose an explanation method to gain insight into the mental state decoding decisions of DL models.

Published in NeuroImage

ISSN: 1053-8119 (Print); 1095-9572 (Online)
Publisher: Elsevier
Country of publisher: United States
LCC subjects: Medicine: Internal medicine: Neurosciences. Biological psychiatry. Neuropsychiatry
Website: https://www.journals.elsevier.com/neuroimage

About the journal

Abstract

Keywords