Learning with sparse reward in a gap junction network inspired by the insect mushroom body.

Tianqi Wei; Qinghai Guo; Barbara Webb

doi:10.1371/journal.pcbi.1012086

PLoS Computational Biology (May 2024)

Learning with sparse reward in a gap junction network inspired by the insect mushroom body.

Tianqi Wei,
Qinghai Guo,
Barbara Webb

Affiliations

Tianqi Wei
Qinghai Guo
Barbara Webb

DOI: https://doi.org/10.1371/journal.pcbi.1012086
Journal volume & issue: Vol. 20, no. 5
p. e1012086

Abstract

Read online

Animals can learn in real-life scenarios where rewards are often only available when a goal is achieved. This 'distal' or 'sparse' reward problem remains a challenge for conventional reinforcement learning algorithms. Here we investigate an algorithm for learning in such scenarios, inspired by the possibility that axo-axonal gap junction connections, observed in neural circuits with parallel fibres such as the insect mushroom body, could form a resistive network. In such a network, an active node represents the task state, connections between nodes represent state transitions and their connection to actions, and current flow to a target state can guide decision making. Building on evidence that gap junction weights are adaptive, we propose that experience of a task can modulate the connections to form a graph encoding the task structure. We demonstrate that the approach can be used for efficient reinforcement learning under sparse rewards, and discuss whether it is plausible as an account of the insect mushroom body.

Published in PLoS Computational Biology

ISSN: 1553-734X (Print); 1553-7358 (Online)
Publisher: Public Library of Science (PLoS)
Country of publisher: United States
LCC subjects: Science: Biology (General)
Website: https://journals.plos.org/ploscompbiol/

About the journal