Complex hybrid weighted pruning method for accelerating convolutional neural networks

Xu Geng; Jinxiong Gao; Yonghui Zhang; Dingtan Xu

doi:10.1038/s41598-024-55942-5

Scientific Reports (Mar 2024)

Complex hybrid weighted pruning method for accelerating convolutional neural networks

Xu Geng,
Jinxiong Gao,
Yonghui Zhang,
Dingtan Xu

Affiliations

Xu Geng: School of Information and Communication Engineering, Hainan University
Jinxiong Gao: School of Information and Communication Engineering, Hainan University
Yonghui Zhang: School of Information and Communication Engineering, Hainan University
Dingtan Xu: School of Information and Communication Engineering, Hainan University

DOI: https://doi.org/10.1038/s41598-024-55942-5
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 11

Abstract

Read online

Abstract The increasing interest in filter pruning of convolutional neural networks stems from its inherent ability to effectively compress and accelerate these networks. Currently, filter pruning is mainly divided into two schools: norm-based and relation-based. These methods aim to selectively remove the least important filters according to predefined rules. However, the limitations of these methods lie in the inadequate consideration of filter diversity and the impact of batch normalization (BN) layers on the input of the next layer, which may lead to performance degradation. To address the above limitations of norm-based and similarity-based methods, this study conducts empirical analyses to reveal their drawbacks and subsequently introduces a groundbreaking complex hybrid weighted pruning method. By evaluating the correlations and norms between individual filters, as well as the parameters of the BN layer, our method effectively identifies and prunes the most redundant filters in a robust manner, thereby avoiding significant decreases in network performance. We conducted comprehensive and direct pruning experiments on different depths of ResNet using publicly available image classification datasets, ImageNet and CIFAR-10. The results demonstrate the significant efficacy of our approach. In particular, when applied to the ResNet-50 on the ImageNet dataset, achieves a significant reduction of 53.5% in floating-point operations, with a performance loss of only 0.6%.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal