A Relaxed Quantization Training Method for Hardware Limitations of Resistive Random Access Memory (ReRAM)-Based Computing-in-Memory

Wei-Chen Wei; Chuan-Jia Jhang; Yi-Ren Chen; Cheng-Xin Xue; Syuan-Hao Sie; Jye-Luen Lee; Hao-Wen Kuo; Chih-Cheng Lu; Meng-Fan Chang; Kea-Tiong Tang

doi:10.1109/JXCDC.2020.2992306

IEEE Journal on Exploratory Solid-State Computational Devices and Circuits (Jan 2020)

A Relaxed Quantization Training Method for Hardware Limitations of Resistive Random Access Memory (ReRAM)-Based Computing-in-Memory

Wei-Chen Wei,
Chuan-Jia Jhang,
Yi-Ren Chen,
Cheng-Xin Xue,
Syuan-Hao Sie,
Jye-Luen Lee,
Hao-Wen Kuo,
Chih-Cheng Lu,
Meng-Fan Chang,
Kea-Tiong Tang

Affiliations

Wei-Chen Wei: Department of Electrical Engineering, National Tsing Hua University (NTHU), Hsinchu, Taiwan
Chuan-Jia Jhang: ORCiD; Department of Electrical Engineering, National Tsing Hua University (NTHU), Hsinchu, Taiwan
Yi-Ren Chen: Department of Electrical Engineering, National Tsing Hua University (NTHU), Hsinchu, Taiwan
Cheng-Xin Xue: Department of Electrical Engineering, National Tsing Hua University (NTHU), Hsinchu, Taiwan
Syuan-Hao Sie: Department of Electrical Engineering, National Tsing Hua University (NTHU), Hsinchu, Taiwan
Jye-Luen Lee: Department of Electrical Engineering, National Tsing Hua University (NTHU), Hsinchu, Taiwan
Hao-Wen Kuo: Department of Electrical Engineering, National Tsing Hua University (NTHU), Hsinchu, Taiwan
Chih-Cheng Lu: Information and Communication Labs, Industrial Technology Research Institute, Chutung, Taiwan
Meng-Fan Chang: Department of Electrical Engineering, National Tsing Hua University (NTHU), Hsinchu, Taiwan
Kea-Tiong Tang: ORCiD; Department of Electrical Engineering, National Tsing Hua University (NTHU), Hsinchu, Taiwan

DOI: https://doi.org/10.1109/JXCDC.2020.2992306
Journal volume & issue: Vol. 6, no. 1
pp. 45 – 52

Abstract

Read online

Nonvolatile computing-in-memory (nvCIM) exhibits high potential for neuromorphic computing involving massive parallel computations and for achieving high energy efficiency. nvCIM is especially suitable for deep neural networks, which are required to perform large amounts of matrix-vector multiplications. However, a comprehensive quantization algorithm has yet to be developed, which overcomes the hardware limitations of resistive random access memory (ReRAM)-based nvCIM, such as the number of I/O, word lines (WLs), and ADC outputs. In this article, we propose a quantization training method for compressing deep models. The method comprises three steps: input and weight quantization, ReRAM convolution (ReConv), and ADC quantization. ADC quantization optimizes the error sampling problem by using the Gumbel-softmax trick. Under a 4-bit ADC of nvCIM, the accuracy only decreases by 0.05% and 1.31% for the MNIST and CIFAR-10, respectively, compared with the corresponding accuracies obtained under an ideal ADC. The experimental results indicate that the proposed method is effective for compensating the hardware limitations of nvCIM macros.

Published in IEEE Journal on Exploratory Solid-State Computational Devices and Circuits

ISSN: 2329-9231 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics: Computer engineering. Computer hardware
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6570653

About the journal

Abstract

Keywords