Stable Low-Rank CP Decomposition for Compression of Convolutional Neural Networks Based on Sensitivity

Chenbin Yang; Huiyi Liu

doi:10.3390/app14041491

Applied Sciences (Feb 2024)

Stable Low-Rank CP Decomposition for Compression of Convolutional Neural Networks Based on Sensitivity

Chenbin Yang,
Huiyi Liu

Affiliations

Chenbin Yang: College of Computer and Information, Hohai University, Nanjing 211100, China
Huiyi Liu: College of Computer and Information, Hohai University, Nanjing 211100, China

DOI: https://doi.org/10.3390/app14041491
Journal volume & issue: Vol. 14, no. 4
p. 1491

Abstract

Read online

Modern convolutional neural networks (CNNs) play a crucial role in computer vision applications. The intricacy of the application scenarios and the growing dataset both significantly raise the complexity of CNNs. As a result, they are often overparameterized and have significant computational costs. One potential solution for optimizing and compressing the CNNs is to replace convolutional layers with low-rank tensor decomposition. The most suitable technique for this is Canonical Polyadic (CP) decomposition. However, there are two primary issues with CP decomposition that lead to a significant loss in accuracy. Firstly, the selection of tensor ranks for CP decomposition is an unsolved issue. Secondly, degeneracy and instability are common problems in the CP decomposition of contractional tensors, which makes fine-tuning the compressed model difficult. In this study, a novel approach was proposed for compressing CNNs by using CP decomposition. The first step involves using the sensitivity of convolutional layers to determine the tensor ranks for CP decomposition effectively. Subsequently, to address the degeneracy issue and enhance the stability of the CP decomposition, two novel techniques were incorporated: optimization with sensitivity constraints and iterative fine-tuning based on sensitivity order. Finally, the proposed method was examined on common CNN structures for image classification tasks and demonstrated that it provides stable performance and significantly fewer reductions in classification accuracy.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords