A new model of decision processing in instrumental learning tasks

Steven Miletić; Russell J Boag; Anne C Trutti; Niek Stevenson; Birte U Forstmann; Andrew Heathcote

doi:10.7554/eLife.63055

eLife (Jan 2021)

A new model of decision processing in instrumental learning tasks

Steven Miletić,
Russell J Boag,
Anne C Trutti,
Niek Stevenson,
Birte U Forstmann,
Andrew Heathcote

Affiliations

Steven Miletić: ORCiD; University of Amsterdam, Department of Psychology, Amsterdam, Netherlands
Russell J Boag: ORCiD; University of Amsterdam, Department of Psychology, Amsterdam, Netherlands
Anne C Trutti: ORCiD; University of Amsterdam, Department of Psychology, Amsterdam, Netherlands; Leiden University, Department of Psychology, Leiden, Netherlands
Niek Stevenson: ORCiD; University of Amsterdam, Department of Psychology, Amsterdam, Netherlands
Birte U Forstmann: ORCiD; University of Amsterdam, Department of Psychology, Amsterdam, Netherlands
Andrew Heathcote: ORCiD; University of Amsterdam, Department of Psychology, Amsterdam, Netherlands; University of Newcastle, School of Psychology, Newcastle, Australia

DOI: https://doi.org/10.7554/eLife.63055
Journal volume & issue: Vol. 10

Abstract

Read online

Learning and decision-making are interactive processes, yet cognitive modeling of error-driven learning and decision-making have largely evolved separately. Recently, evidence accumulation models (EAMs) of decision-making and reinforcement learning (RL) models of error-driven learning have been combined into joint RL-EAMs that can in principle address these interactions. However, we show that the most commonly used combination, based on the diffusion decision model (DDM) for binary choice, consistently fails to capture crucial aspects of response times observed during reinforcement learning. We propose a new RL-EAM based on an advantage racing diffusion (ARD) framework for choices among two or more options that not only addresses this problem but captures stimulus difficulty, speed-accuracy trade-off, and stimulus-response-mapping reversal effects. The RL-ARD avoids fundamental limitations imposed by the DDM on addressing effects of absolute values of choices, as well as extensions beyond binary choice, and provides a computationally tractable basis for wider applications.

Published in eLife

ISSN: 2050-084X (Online)
Publisher: eLife Sciences Publications Ltd
Country of publisher: United Kingdom
LCC subjects: Medicine; Science: Biology (General)
Website: https://elifesciences.org

About the journal

Abstract

Keywords