Neural and computational underpinnings of biased confidence in human reinforcement learning

Chih-Chung Ting; Nahuel Salem-Garcia; Stefano Palminteri; Jan B. Engelmann; Maël Lebreton

doi:10.1038/s41467-023-42589-5

Nature Communications (Oct 2023)

Neural and computational underpinnings of biased confidence in human reinforcement learning

Chih-Chung Ting,
Nahuel Salem-Garcia,
Stefano Palminteri,
Jan B. Engelmann,
Maël Lebreton

Affiliations

Chih-Chung Ting: General Psychology, Universität Hamburg
Nahuel Salem-Garcia: Swiss Center for Affective Science, Faculty of Psychology and Educational Sciences, University of Geneva
Stefano Palminteri: Département d’Études Cognitives, École Normale Supérieure, PSL Research University
Jan B. Engelmann: CREED, Amsterdam School of Economics (ASE), Universiteit van Amsterdam
Maël Lebreton: Swiss Center for Affective Science, Faculty of Psychology and Educational Sciences, University of Geneva

DOI: https://doi.org/10.1038/s41467-023-42589-5
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 18

Abstract

Read online

Abstract While navigating a fundamentally uncertain world, humans and animals constantly evaluate the probability of their decisions, actions or statements being correct. When explicitly elicited, these confidence estimates typically correlates positively with neural activity in a ventromedial-prefrontal (VMPFC) network and negatively in a dorsolateral and dorsomedial prefrontal network. Here, combining fMRI with a reinforcement-learning paradigm, we leverage the fact that humans are more confident in their choices when seeking gains than avoiding losses to reveal a functional dissociation: whereas the dorsal prefrontal network correlates negatively with a condition-specific confidence signal, the VMPFC network positively encodes task-wide confidence signal incorporating the valence-induced bias. Challenging dominant neuro-computational models, we found that decision-related VMPFC activity better correlates with confidence than with option-values inferred from reinforcement-learning models. Altogether, these results identify the VMPFC as a key node in the neuro-computational architecture that builds global feeling-of-confidence signals from latent decision variables and contextual biases during reinforcement-learning.

Published in Nature Communications

ISSN: 2041-1723 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Science
Website: https://www.nature.com/ncomms/

About the journal