Maximizing Parallel Activation of Word-Lines in MRAM-Based Binary Neural Network Accelerators

Daehyun Ahn; Hyunmyung Oh; Hyungjun Kim; Yulhwa Kim; Jae-Joon Kim

doi:10.1109/ACCESS.2021.3121011

IEEE Access (Jan 2021)

Maximizing Parallel Activation of Word-Lines in MRAM-Based Binary Neural Network Accelerators

Daehyun Ahn,
Hyunmyung Oh,
Hyungjun Kim,
Yulhwa Kim,
Jae-Joon Kim

Affiliations

Daehyun Ahn: ORCiD; Department of Convergence IT Engineering, Pohang University of Science and Technology, Pohang, South Korea
Hyunmyung Oh: Department of Convergence IT Engineering, Pohang University of Science and Technology, Pohang, South Korea
Hyungjun Kim: ORCiD; Department of Convergence IT Engineering, Pohang University of Science and Technology, Pohang, South Korea
Yulhwa Kim: Department of Convergence IT Engineering, Pohang University of Science and Technology, Pohang, South Korea
Jae-Joon Kim: ORCiD; Department of Electrical and Computer Engineering, Seoul National University, Seoul, South Korea

DOI: https://doi.org/10.1109/ACCESS.2021.3121011
Journal volume & issue: Vol. 9
pp. 141961 – 141969

Abstract

Read online

Magnetic RAM (MRAM)-based crossbar array has a great potential as a platform for in-memory binary neural network (BNN) computing. However, the number of word-lines that can be activated simultaneously is limited because of the low $I_{H}/I_{L}$ ratio of MRAM, which makes BNNs more vulnerable to the device variation. To address this issue, we propose an algorithm/hardware co-design methodology. First, we choose a promising memristor crossbar array (MCA) structure based on the sensitivity analysis to process variations. Since the selected MCA structure becomes more tolerant to the device variation when the number of 1 in input activation values decreases, we apply an input distribution regularization scheme to reduce the number of 1 in input of BNNs during training. We further improve the robustness against device variation by adopting the retraining scheme based on knowledge distillation. Experimental results show that the proposed method makes BNNs more tolerant to MRAM variation and increases the number of parallel word-line activation significantly; thereby achieving improved throughput and energy efficiency.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords