PLoS Computational Biology (Nov 2006)

Humans can adopt optimal discounting strategy under real-time constraints.

  • N Schweighofer,
  • K Shishida,
  • C E Han,
  • Y Okamoto,
  • S C Tanaka,
  • S Yamawaki,
  • K Doya

DOI
https://doi.org/10.1371/journal.pcbi.0020152
Journal volume & issue
Vol. 2, no. 11
p. e152

Abstract

Read online

Critical to our many daily choices between larger delayed rewards, and smaller more immediate rewards, are the shape and the steepness of the function that discounts rewards with time. Although research in artificial intelligence favors exponential discounting in uncertain environments, studies with humans and animals have consistently shown hyperbolic discounting. We investigated how humans perform in a reward decision task with temporal constraints, in which each choice affects the time remaining for later trials, and in which the delays vary at each trial. We demonstrated that most of our subjects adopted exponential discounting in this experiment. Further, we confirmed analytically that exponential discounting, with a decay rate comparable to that used by our subjects, maximized the total reward gain in our task. Our results suggest that the particular shape and steepness of temporal discounting is determined by the task that the subject is facing, and question the notion of hyperbolic reward discounting as a universal principle.