IEEE Transactions on Neural Systems and Rehabilitation Engineering (Jan 2025)
XCF-LSTMSATNet: A Classification Approach for EEG Signals Evoked by Dynamic Random Dot Stereograms
Abstract
Stereovision is the visual perception of depth derived from the integration of two slightly different images from each eye, enabling understanding of the three-dimensional space. This capability is deeply intertwined with cognitive brain functions. To explore the impact of stereograms with varied motions on brain activities, we collected Electroencephalography (EEG) signals evoked by Dynamic Random Dot Stereograms (DRDS). To effectively classify the EEG signals induced by DRDS, we introduced a novel hybrid neural network model, XCF-LSTMSATNet, which integrates an XGBoost Channel Feature Optimization Module with the EEGNet and an LSTM Self-Attention Modules. Initially, in the channel selection phase, XGBoost is employed for preliminary classification and feature weight analysis, which can enhance our channel selection strategy. Following this, EEGNet employs deep convolutional layers to extract spatial features, while separable convolutions are subsequently used to derive high-dimensional spatial-temporal features. Meanwhile, the LSTMSAT Module, with its capability to learn long-term dependencies in time-series signals, is deployed to capture temporal continuity information. The incorporation of the self-attention mechanism further amplifies the model’s ability to grasp long-distance dependencies and enables dynamic weight allocation to the extracted features. In the end, both temporal and spatial features are integrated into the classification module, enabling precise prediction across three categories of EEG signals. The proposed XCF-LSTMSATNet was extensively tested on both a custom dataset and the public datasets SRDA and SRDB. The results demonstrate that the model exhibits solid classification performance across all three datasets, effectively showcasing its robustness and generalization capabilities.
Keywords