Optimism and pessimism in optimised replay.

Georgy Antonov; Christopher Gagne; Eran Eldar; Peter Dayan

doi:10.1371/journal.pcbi.1009634

PLoS Computational Biology (Jan 2022)

Optimism and pessimism in optimised replay.

Georgy Antonov,
Christopher Gagne,
Eran Eldar,
Peter Dayan

Affiliations

Georgy Antonov
Christopher Gagne
Eran Eldar
Peter Dayan

DOI: https://doi.org/10.1371/journal.pcbi.1009634
Journal volume & issue: Vol. 18, no. 1
p. e1009634

Abstract

Read online

The replay of task-relevant trajectories is known to contribute to memory consolidation and improved task performance. A wide variety of experimental data show that the content of replayed sequences is highly specific and can be modulated by reward as well as other prominent task variables. However, the rules governing the choice of sequences to be replayed still remain poorly understood. One recent theoretical suggestion is that the prioritization of replay experiences in decision-making problems is based on their effect on the choice of action. We show that this implies that subjects should replay sub-optimal actions that they dysfunctionally choose rather than optimal ones, when, by being forgetful, they experience large amounts of uncertainty in their internal models of the world. We use this to account for recent experimental data demonstrating exactly pessimal replay, fitting model parameters to the individual subjects' choices.

Published in PLoS Computational Biology

ISSN: 1553-734X (Print); 1553-7358 (Online)
Publisher: Public Library of Science (PLoS)
Country of publisher: United States
LCC subjects: Science: Biology (General)
Website: https://journals.plos.org/ploscompbiol/

About the journal