Auto-Spikformer: Spikformer architecture search

Kaiwei Che; Kaiwei Che; Zhaokun Zhou; Zhaokun Zhou; Jun Niu; Zhengyu Ma; Wei Fang; Wei Fang; Yanqi Chen; Yanqi Chen; Shuaijie Shen; Li Yuan; Li Yuan; Yonghong Tian; Yonghong Tian

doi:10.3389/fnins.2024.1372257

Frontiers in Neuroscience (Jul 2024)

Auto-Spikformer: Spikformer architecture search

Kaiwei Che,
Kaiwei Che,
Zhaokun Zhou,
Zhaokun Zhou,
Jun Niu,
Zhengyu Ma,
Wei Fang,
Wei Fang,
Yanqi Chen,
Yanqi Chen,
Shuaijie Shen,
Li Yuan,
Li Yuan,
Yonghong Tian,
Yonghong Tian

Affiliations

Kaiwei Che: School of Electronic and Computer Engineering, Shenzhen Graduate School, Peking University, Shenzhen, Guangdong, China
Kaiwei Che: Peng Cheng Laboratory, Shenzhen, Guangdong, China
Zhaokun Zhou: School of Electronic and Computer Engineering, Shenzhen Graduate School, Peking University, Shenzhen, Guangdong, China
Zhaokun Zhou: Peng Cheng Laboratory, Shenzhen, Guangdong, China
Jun Niu: School of Electronic and Computer Engineering, Shenzhen Graduate School, Peking University, Shenzhen, Guangdong, China
Zhengyu Ma: Peng Cheng Laboratory, Shenzhen, Guangdong, China
Wei Fang: School of Electronic and Computer Engineering, Shenzhen Graduate School, Peking University, Shenzhen, Guangdong, China
Wei Fang: Peng Cheng Laboratory, Shenzhen, Guangdong, China
Yanqi Chen: School of Electronic and Computer Engineering, Shenzhen Graduate School, Peking University, Shenzhen, Guangdong, China
Yanqi Chen: Peng Cheng Laboratory, Shenzhen, Guangdong, China
Shuaijie Shen: Department of Computer Science and Engineering, Southern University of Science and Technology, Shenzhen, Guangdong, China
Li Yuan: School of Electronic and Computer Engineering, Shenzhen Graduate School, Peking University, Shenzhen, Guangdong, China
Li Yuan: Peng Cheng Laboratory, Shenzhen, Guangdong, China
Yonghong Tian: School of Electronic and Computer Engineering, Shenzhen Graduate School, Peking University, Shenzhen, Guangdong, China
Yonghong Tian: Peng Cheng Laboratory, Shenzhen, Guangdong, China

DOI: https://doi.org/10.3389/fnins.2024.1372257
Journal volume & issue: Vol. 18

Abstract

Read online

IntroductionThe integration of self-attention mechanisms into Spiking Neural Networks (SNNs) has garnered considerable interest in the realm of advanced deep learning, primarily due to their biological properties. Recent advancements in SNN architecture, such as Spikformer, have demonstrated promising outcomes. However, we observe that Spikformer may exhibit excessive energy consumption, potentially attributable to redundant channels and blocks.MethodsTo mitigate this issue, we propose a one-shot Spiking Transformer Architecture Search method, namely Auto-Spikformer. Auto-Spikformer extends the search space to include both transformer architecture and SNN inner parameters. We train and search the supernet based on weight entanglement, evolutionary search, and the proposed Discrete Spiking Parameters Search (DSPS) methods. Benefiting from these methods, the performance of subnets with weights inherited from the supernet without even retraining is comparable to the original Spikformer. Moreover, we propose a new fitness function aiming to find a Pareto optimal combination balancing energy consumption and accuracy.Results and discussionOur experimental results demonstrate the effectiveness of Auto-Spikformer, which outperforms the original Spikformer and most CNN or ViT models with even fewer parameters and lower energy consumption.

Published in Frontiers in Neuroscience

ISSN: 1662-4548 (Print); 1662-453X (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Medicine: Internal medicine: Neurosciences. Biological psychiatry. Neuropsychiatry
Website: http://www.frontiersin.org/neuroscience

About the journal

Abstract

Keywords