ENOS: Energy-Aware Network Operator Search in Deep Neural Networks

Shamma Nasrin; Ahish Shylendra; Nastaran Darabi; Theja Tulabandhula; Wilfred Gomes; Ankush Chakrabarty; Amit Ranjan Trivedi

doi:10.1109/ACCESS.2022.3192515

IEEE Access (Jan 2022)

ENOS: Energy-Aware Network Operator Search in Deep Neural Networks

Shamma Nasrin,
Ahish Shylendra,
Nastaran Darabi,
Theja Tulabandhula,
Wilfred Gomes,
Ankush Chakrabarty,
Amit Ranjan Trivedi

Affiliations

Shamma Nasrin: ORCiD; Department of Electrical and Computer Engineering, University of Illinois at Chicago, Chicago, IL, USA
Ahish Shylendra: ORCiD; Department of Electrical and Computer Engineering, University of Illinois at Chicago, Chicago, IL, USA
Nastaran Darabi: Department of Electrical and Computer Engineering, University of Illinois at Chicago, Chicago, IL, USA
Theja Tulabandhula: ORCiD; Department of Information and Decision Sciences, University of Illinois at Chicago, Chicago, IL, USA
Wilfred Gomes: Intel Corporation, Hillsboro, OR, USA
Ankush Chakrabarty: ORCiD; Mitsubishi Electric Research Laboratories, Cambridge, MA, USA
Amit Ranjan Trivedi: ORCiD; Department of Electrical and Computer Engineering, University of Illinois at Chicago, Chicago, IL, USA

DOI: https://doi.org/10.1109/ACCESS.2022.3192515
Journal volume & issue: Vol. 10
pp. 81447 – 81457

Abstract

Read online

This work proposes a novel Energy-aware Network Operator Search (ENOS) approach to address the energy-accuracy trade-offs of a deep neural network (DNN) accelerator. In recent years, novel hardware-friendly inference operators such as binary-weight, multiplication-free, and deep-shift have been proposed to improve the computational efficiency of a DNN accelerator. However, simplifying DNN operators invariably comes at lower accuracy, especially on complex processing tasks. While prior works generally implement the same inference operator throughout the neural architecture, the proposed ENOS framework allows an optimal layer-wise integration of inference operators with optimal precision to maintain high prediction accuracy and high energy efficiency. The search in ENOS is formulated as a continuous optimization problem, solvable using gradient descent methods, thereby minimally increasing the training cost when learning both layer-wise inference operators and weights. Utilizing ENOS, we discuss multiply-accumulate (MAC) cores for digital spatial architectures that can be reconfigured to different operators and varying computing precision. ENOS training methods with single and bi-level optimization objectives are discussed and compared. We also discuss a sequential operator assignment strategy in ENOS that only learns the assignment for one layer in one training step. Furthermore, a stochastic mode of ENOS is also presented. ENOS is characterized on ShuffleNet and SqueezeNet using CIFAR10 and CIFAR100. Compared to the conventional uni-operator approaches, under the same energy budget, ENOS improves accuracy by 10–20%. ENOS also outperforms the accuracy of comparable mixed-precision uni-operator implementations by 3-5% for the same energy budget.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords