Combining backpropagation with Equilibrium Propagation to improve an Actor-Critic reinforcement learning framework

Yoshimasa Kubo; Eric Chalmers; Artur Luczak

doi:10.3389/fncom.2022.980613

Frontiers in Computational Neuroscience (Aug 2022)

Combining backpropagation with Equilibrium Propagation to improve an Actor-Critic reinforcement learning framework

Yoshimasa Kubo,
Eric Chalmers,
Artur Luczak

Affiliations

Yoshimasa Kubo: Canadian Centre for Behavioural Neuroscience, University of Lethbridge, Lethbridge, AB, Canada
Eric Chalmers: Department of Mathematics and Computing, Mount Royal University, Calgary, AB, Canada
Artur Luczak: Canadian Centre for Behavioural Neuroscience, University of Lethbridge, Lethbridge, AB, Canada

DOI: https://doi.org/10.3389/fncom.2022.980613
Journal volume & issue: Vol. 16

Abstract

Read online

Backpropagation (BP) has been used to train neural networks for many years, allowing them to solve a wide variety of tasks like image classification, speech recognition, and reinforcement learning tasks. But the biological plausibility of BP as a mechanism of neural learning has been questioned. Equilibrium Propagation (EP) has been proposed as a more biologically plausible alternative and achieves comparable accuracy on the CIFAR-10 image classification task. This study proposes the first EP-based reinforcement learning architecture: an Actor-Critic architecture with the actor network trained by EP. We show that this model can solve the basic control tasks often used as benchmarks for BP-based models. Interestingly, our trained model demonstrates more consistent high-reward behavior than a comparable model trained exclusively by BP.

Published in Frontiers in Computational Neuroscience

ISSN: 1662-5188 (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Medicine: Internal medicine: Neurosciences. Biological psychiatry. Neuropsychiatry
Website: http://www.frontiersin.org/computational_neuroscience

About the journal

Abstract

Keywords