Deep compression of convolutional neural networks with low‐rank approximation

Marcella Astrid; Seung‐Ik Lee

doi:10.4218/etrij.2018-0065

ETRI Journal (Aug 2018)

Deep compression of convolutional neural networks with low‐rank approximation

Marcella Astrid,
Seung‐Ik Lee

Affiliations

Marcella Astrid
Seung‐Ik Lee

DOI: https://doi.org/10.4218/etrij.2018-0065
Journal volume & issue: Vol. 40, no. 4
pp. 421 – 434

Abstract

Read online

The application of deep neural networks (DNNs) to connect the world with cyber physical systems (CPSs) has attracted much attention. However, DNNs require a large amount of memory and computational cost, which hinders their use in the relatively low‐end smart devices that are widely used in CPSs. In this paper, we aim to determine whether DNNs can be efficiently deployed and operated in low‐end smart devices. To do this, we develop a method to reduce the memory requirement of DNNs and increase the inference speed, while maintaining the performance (for example, accuracy) close to the original level. The parameters of DNNs are decomposed using a hybrid of canonical polyadic–singular value decomposition, approximated using a tensor power method, and fine‐tuned by performing iterative one‐shot hybrid fine‐tuning to recover from a decreased accuracy. In this study, we evaluate our method on frequently used networks. We also present results from extensive experiments on the effects of several fine‐tuning methods, the importance of iterative fine‐tuning, and decomposition techniques. We demonstrate the effectiveness of the proposed method by deploying compressed networks in smartphones.

Published in ETRI Journal

ISSN: 1225-6463 (Print); 2233-7326 (Online)
Publisher: Electronics and Telecommunications Research Institute (ETRI)
Country of publisher: Korea, Republic of
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Telecommunication; Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics
Website: https://onlinelibrary.wiley.com/journal/22337326

About the journal

Abstract

Keywords