ERTNet: an interpretable transformer-based framework for EEG emotion recognition

Ruixiang Liu; Yihu Chao; Xuerui Ma; Xianzheng Sha; Limin Sun; Shuo Li; Shijie Chang

doi:10.3389/fnins.2024.1320645

Frontiers in Neuroscience (Jan 2024)

ERTNet: an interpretable transformer-based framework for EEG emotion recognition

Ruixiang Liu,
Yihu Chao,
Xuerui Ma,
Xianzheng Sha,
Limin Sun,
Shuo Li,
Shijie Chang

Affiliations

Ruixiang Liu: School of Intelligent Medicine, China Medical University, Shenyang, Liaoning, China
Yihu Chao: School of Intelligent Medicine, China Medical University, Shenyang, Liaoning, China
Xuerui Ma: School of Intelligent Medicine, China Medical University, Shenyang, Liaoning, China
Xianzheng Sha: School of Intelligent Medicine, China Medical University, Shenyang, Liaoning, China
Limin Sun: Shanghai Institute of Microsystem and Information Technology, Chinese Academy of Sciences, Shanghai, China
Shuo Li: School of Life Sciences, China Medical University, Shenyang, Liaoning, China
Shijie Chang: School of Intelligent Medicine, China Medical University, Shenyang, Liaoning, China

DOI: https://doi.org/10.3389/fnins.2024.1320645
Journal volume & issue: Vol. 18

Abstract

Read online

BackgroundEmotion recognition using EEG signals enables clinicians to assess patients’ emotional states with precision and immediacy. However, the complexity of EEG signal data poses challenges for traditional recognition methods. Deep learning techniques effectively capture the nuanced emotional cues within these signals by leveraging extensive data. Nonetheless, most deep learning techniques lack interpretability while maintaining accuracy.MethodsWe developed an interpretable end-to-end EEG emotion recognition framework rooted in the hybrid CNN and transformer architecture. Specifically, temporal convolution isolates salient information from EEG signals while filtering out potential high-frequency noise. Spatial convolution discerns the topological connections between channels. Subsequently, the transformer module processes the feature maps to integrate high-level spatiotemporal features, enabling the identification of the prevailing emotional state.ResultsExperiments’ results demonstrated that our model excels in diverse emotion classification, achieving an accuracy of 74.23% ± 2.59% on the dimensional model (DEAP) and 67.17% ± 1.70% on the discrete model (SEED-V). These results surpass the performances of both CNN and LSTM-based counterparts. Through interpretive analysis, we ascertained that the beta and gamma bands in the EEG signals exert the most significant impact on emotion recognition performance. Notably, our model can independently tailor a Gaussian-like convolution kernel, effectively filtering high-frequency noise from the input EEG data.DiscussionGiven its robust performance and interpretative capabilities, our proposed framework is a promising tool for EEG-driven emotion brain-computer interface.

Published in Frontiers in Neuroscience

ISSN: 1662-4548 (Print); 1662-453X (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Medicine: Internal medicine: Neurosciences. Biological psychiatry. Neuropsychiatry
Website: http://www.frontiersin.org/neuroscience

About the journal

Abstract

Keywords