Applied Sciences (Sep 2024)

Exploring Inner Speech Recognition via Cross-Perception Approach in EEG and fMRI

  • Jiahao Qin,
  • Lu Zong,
  • Feng Liu

DOI
https://doi.org/10.3390/app14177720
Journal volume & issue
Vol. 14, no. 17
p. 7720

Abstract

Read online

Multimodal brain signal analysis has shown great potential in decoding complex cognitive processes, particularly in the challenging task of inner speech recognition. This paper introduces an innovative I nner Speech Recognition via Cross-Perception (ISRCP) approach that significantly enhances accuracy by fusing electroencephalography (EEG) and functional magnetic resonance imaging (fMRI) data. Our approach comprises three core components: (1) multigranularity encoders that separately process EEG time series, EEG Markov Transition Fields, and fMRI spatial data; (2) a cross-perception expert structure that learns both modality-specific and shared representations; and (3) an attention-based adaptive fusion strategy that dynamically adjusts the contributions of different modalities based on task relevance. Extensive experiments on the Bimodal Dataset on Inner Speech demonstrate that our model outperforms existing methods across accuracy and F1 score.

Keywords