Frontiers in Neuroscience (Apr 2021)

In situ Parallel Training of Analog Neural Network Using Electrochemical Random-Access Memory

  • Yiyang Li,
  • T. Patrick Xiao,
  • Christopher H. Bennett,
  • Erik Isele,
  • Armantas Melianas,
  • Hanbo Tao,
  • Matthew J. Marinella,
  • Alberto Salleo,
  • Elliot J. Fuller,
  • A. Alec Talin

DOI
https://doi.org/10.3389/fnins.2021.636127
Journal volume & issue
Vol. 15

Abstract

Read online

In-memory computing based on non-volatile resistive memory can significantly improve the energy efficiency of artificial neural networks. However, accurate in situ training has been challenging due to the nonlinear and stochastic switching of the resistive memory elements. One promising analog memory is the electrochemical random-access memory (ECRAM), also known as the redox transistor. Its low write currents and linear switching properties across hundreds of analog states enable accurate and massively parallel updates of a full crossbar array, which yield rapid and energy-efficient training. While simulations predict that ECRAM based neural networks achieve high training accuracy at significantly higher energy efficiency than digital implementations, these predictions have not been experimentally achieved. In this work, we train a 3 × 3 array of ECRAM devices that learns to discriminate several elementary logic gates (AND, OR, NAND). We record the evolution of the network’s synaptic weights during parallel in situ (on-line) training, with outer product updates. Due to linear and reproducible device switching characteristics, our crossbar simulations not only accurately simulate the epochs to convergence, but also quantitatively capture the evolution of weights in individual devices. The implementation of the first in situ parallel training together with strong agreement with simulation results provides a significant advance toward developing ECRAM into larger crossbar arrays for artificial neural network accelerators, which could enable orders of magnitude improvements in energy efficiency of deep neural networks.

Keywords