Towards Convolutional Neural Network Acceleration and Compression Based on Simonk-Means

Mingjie Wei; Yunping Zhao; Xiaowen Chen; Chen Li; Jianzhuang Lu

doi:10.3390/s22114298

Sensors (Jun 2022)

Towards Convolutional Neural Network Acceleration and Compression Based on Simonk-Means

Mingjie Wei,
Yunping Zhao,
Xiaowen Chen,
Chen Li,
Jianzhuang Lu

Affiliations

Mingjie Wei: The College of Computer Science, National University of Defence Technology, Changsha 410000, China
Yunping Zhao: The College of Computer Science, National University of Defence Technology, Changsha 410000, China
Xiaowen Chen: The College of Computer Science, National University of Defence Technology, Changsha 410000, China
Chen Li: The College of Computer Science, National University of Defence Technology, Changsha 410000, China
Jianzhuang Lu: The College of Computer Science, National University of Defence Technology, Changsha 410000, China

DOI: https://doi.org/10.3390/s22114298
Journal volume & issue: Vol. 22, no. 11
p. 4298

Abstract

Read online

Convolutional Neural Networks (CNNs) are popular models that are widely used in image classification, target recognition, and other fields. Model compression is a common step in transplanting neural networks into embedded devices, and it is often used in the retraining stage. However, it requires a high expenditure of time by retraining weight data to atone for the loss of precision. Unlike in prior designs, we propose a novel model compression approach based on Simonk-means, which is specifically designed to support a hardware acceleration scheme. First, we propose an extension algorithm named Simonk-means based on simple k-means. We use Simonk-means to cluster trained weights in convolutional layers and fully connected layers. Second, we reduce the consumption of hardware resources in data movement and storage by using a data storage and index approach. Finally, we provide the hardware implementation of the compressed CNN accelerator. Our evaluations on several classifications show that our design can achieve 5.27× compression and reduce 74.3% of the multiply–accumulate (MAC) operations in AlexNet on the FASHION-MNIST dataset.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords