A Compressed Data Partition and Loop Scheduling Scheme for Neural Networks

Dejian Li; Rongqiang Fang; Jing Wang; Dongyan Zhao; Ting Chong; Zengmin Ren; Jun Ma

doi:10.1109/ACCESS.2022.3204038

IEEE Access (Jan 2022)

A Compressed Data Partition and Loop Scheduling Scheme for Neural Networks

Dejian Li,
Rongqiang Fang,
Jing Wang,
Dongyan Zhao,
Ting Chong,
Zengmin Ren,
Jun Ma

Affiliations

Dejian Li: Beijing Smartchip Microelectronics Technology Company Ltd., Beijing, China
Rongqiang Fang: School of Computer and Information Technology, Beijing Jiaotong University, Beijing, China
Jing Wang: School of Information, Renmin University of China, Beijing, China
Dongyan Zhao: ORCiD; Beijing Smartchip Microelectronics Technology Company Ltd., Beijing, China
Ting Chong: Beijing Smartchip Microelectronics Technology Company Ltd., Beijing, China
Zengmin Ren: Beijing Smartchip Microelectronics Technology Company Ltd., Beijing, China
Jun Ma: Beijing Smartchip Microelectronics Technology Company Ltd., Beijing, China

DOI: https://doi.org/10.1109/ACCESS.2022.3204038
Journal volume & issue: Vol. 10
pp. 95219 – 95228

Abstract

Read online

Neural networks (NNs) have been widely adopted in various application domains. Deeper NNs greatly enhance the output accuracy, but complex NNs with more parameters incur intensive memory accesses, and the data usually need to be partitioned since it may exceed the on-chip storage. However, there is no research considering the partition and scheduling co-design of the NNs. In this paper, we propose a sparse NN data partition and loop scheduling scheme. We establish the compression efficiency model of the matrix sparse algorithm and design a partition selection method based on sparsity characteristics analyzed by the compression efficiency model. Further, we design a loop scheduling scheme based on the proper partition size. The experiment results show that the average memory access of each layer can be compressed to 68% of the original, and the throughput of the AlexNet, VGG and VGG19 is increased to an average of 1.66 times.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords