BrainNPT: Pre-Training Transformer Networks for Brain Network Classification

Jinlong Hu; Yangmin Huang; Nan Wang; Shoubin Dong

doi:10.1109/TNSRE.2024.3434343

IEEE Transactions on Neural Systems and Rehabilitation Engineering (Jan 2024)

BrainNPT: Pre-Training Transformer Networks for Brain Network Classification

Jinlong Hu,
Yangmin Huang,
Nan Wang,
Shoubin Dong

Affiliations

Jinlong Hu: ORCiD; Guangdong Key Laboratory of Communication and Computer Network, School of Computer Science and Engineering, South China University of Technology, Guangzhou, China
Yangmin Huang: ORCiD; Guangdong Key Laboratory of Communication and Computer Network, School of Computer Science and Engineering, South China University of Technology, Guangzhou, China
Nan Wang: School of Computer Science and Technology, East China Normal University, Shanghai, China
Shoubin Dong: ORCiD; Guangdong Key Laboratory of Communication and Computer Network, School of Computer Science and Engineering, South China University of Technology, Guangzhou, China

DOI: https://doi.org/10.1109/TNSRE.2024.3434343
Journal volume & issue: Vol. 32
pp. 2727 – 2736

Abstract

Read online

Deep learning methods have advanced quickly in brain imaging analysis over the past few years, but they are usually restricted by the limited labeled data. Pre-trained model on unlabeled data has presented promising improvement in feature learning in many domains, such as natural language processing. However, this technique is under-explored in brain network analysis. In this paper, we focused on pre-training methods with Transformer networks to leverage existing unlabeled data for brain functional network classification. First, we proposed a Transformer-based neural network, named as BrainNPT, for brain functional network classification. The proposed method leveraged <cls> token as a classification embedding vector for the Transformer model to effectively capture the representation of brain networks. Second, we proposed a pre-training framework for BrainNPT model to leverage unlabeled brain network data to learn the structure information of brain functional networks. The results of classification experiments demonstrated the BrainNPT model without pre-training achieved the best performance with the state-of-the-art models, and the BrainNPT model with pre-training strongly outperformed the state-of-the-art models. The pre-training BrainNPT model improved 8.75% of accuracy compared with the model without pre-training. We further compared the pre-training strategies and the data augmentation methods, analyzed the influence of the parameters of the model, and explained the trained model.

Published in IEEE Transactions on Neural Systems and Rehabilitation Engineering

ISSN: 1534-4320 (Print); 1558-0210 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Medicine: Medicine (General): Medical technology; Medicine: Therapeutics. Pharmacology
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=7333

About the journal

Abstract

Keywords