Energy-Efficient DNN Training Processors on Micro-AI Systems

Donghyeon Han; Sanghoon Kang; Sangyeob Kim; Juhyoung Lee; Hoi-Jun Yoo

doi:10.1109/OJSSCS.2022.3219034

IEEE Open Journal of the Solid-State Circuits Society (Jan 2022)

Energy-Efficient DNN Training Processors on Micro-AI Systems

Donghyeon Han,
Sanghoon Kang,
Sangyeob Kim,
Juhyoung Lee,
Hoi-Jun Yoo

Affiliations

Donghyeon Han: ORCiD; School of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, South Korea
Sanghoon Kang: ORCiD; School of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, South Korea
Sangyeob Kim: ORCiD; School of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, South Korea
Juhyoung Lee: ORCiD; School of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, South Korea
Hoi-Jun Yoo: ORCiD; School of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, South Korea

DOI: https://doi.org/10.1109/OJSSCS.2022.3219034
Journal volume & issue: Vol. 2
pp. 259 – 275

Abstract

Read online

Many edge/mobile devices are now able to utilize deep neural networks (DNNs) thanks to the development of mobile DNN accelerators. Mobile DNN accelerators overcame the problems of limited computing resources and battery capacity by realizing energy-efficient inference. However, its passive behavior makes it difficult for DNN to provide active customization for individual users or its service environment. The importance of on-chip training is rising more and more to provide active interaction between DNN processors and ever-changing surroundings or conditions. Despite its advantages, the DNN training has more constraints than the inference such that it was considered impractical to be realized on mobile/edge devices. Recently, there are many trials to realize mobile DNN training, and a number of prior works will be summarized. First, it arranges the new challenges of the DNN accelerator induced by training functionality and discusses new hardware features related to the challenges. Second, it explains algorithm-hardware co-optimization methods and explains why it becomes mainstream in mobile DNN training research. Third, it compares the main differences between the conventional inference accelerators and recent training processors. Finally, the conclusion is made by proposing the future directions of the DNN training processor in micro-AI systems.

Published in IEEE Open Journal of the Solid-State Circuits Society

ISSN: 2644-1349 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electric apparatus and materials. Electric circuits. Electric networks
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=8782712

About the journal

Abstract

Keywords