Max Planck UCL Centre for Computational Psychiatry and Ageing Research, London, United Kingdom; Wellcome Trust Centre for Neuroimaging, University College London, London, United Kingdom
Johanna Habicht
Max Planck UCL Centre for Computational Psychiatry and Ageing Research, London, United Kingdom; Wellcome Trust Centre for Neuroimaging, University College London, London, United Kingdom
Jochen Michely
Max Planck UCL Centre for Computational Psychiatry and Ageing Research, London, United Kingdom; Wellcome Trust Centre for Neuroimaging, University College London, London, United Kingdom; Department of Psychiatry and Psychotherapy, Charité – Universitätsmedizin Berlin, Berlin, Germany
Max Planck UCL Centre for Computational Psychiatry and Ageing Research, London, United Kingdom; Wellcome Trust Centre for Neuroimaging, University College London, London, United Kingdom
Max Planck UCL Centre for Computational Psychiatry and Ageing Research, London, United Kingdom; Wellcome Trust Centre for Neuroimaging, University College London, London, United Kingdom
Max Planck UCL Centre for Computational Psychiatry and Ageing Research, London, United Kingdom; Wellcome Trust Centre for Neuroimaging, University College London, London, United Kingdom
An exploration-exploitation trade-off, the arbitration between sampling a lesser-known against a known rich option, is thought to be solved using computationally demanding exploration algorithms. Given known limitations in human cognitive resources, we hypothesised the presence of additional cheaper strategies. We examined for such heuristics in choice behaviour where we show this involves a value-free random exploration, that ignores all prior knowledge, and a novelty exploration that targets novel options alone. In a double-blind, placebo-controlled drug study, assessing contributions of dopamine (400 mg amisulpride) and noradrenaline (40 mg propranolol), we show that value-free random exploration is attenuated under the influence of propranolol, but not under amisulpride. Our findings demonstrate that humans deploy distinct computationally cheap exploration strategies and that value-free random exploration is under noradrenergic control.