Boredom-Driven Curious Learning by Homeo-Heterostatic Value Gradients

Yen Yu; Acer Y. C. Chang; Ryota Kanai

doi:10.3389/fnbot.2018.00088

Frontiers in Neurorobotics (Jan 2019)

Boredom-Driven Curious Learning by Homeo-Heterostatic Value Gradients

Yen Yu,
Acer Y. C. Chang,
Ryota Kanai

Affiliations

Yen Yu
Acer Y. C. Chang
Ryota Kanai

DOI: https://doi.org/10.3389/fnbot.2018.00088
Journal volume & issue: Vol. 12

Abstract

Read online

This paper presents the Homeo-Heterostatic Value Gradients (HHVG) algorithm as a formal account on the constructive interplay between boredom and curiosity which gives rise to effective exploration and superior forward model learning. We offer an instrumental view of action selection, in which an action serves to disclose outcomes that have intrinsic meaningfulness to an agent itself. This motivated two central algorithmic ingredients: devaluation and devaluation progress, both underpin agent's cognition concerning intrinsically generated rewards. The two serve as an instantiation of homeostatic and heterostatic intrinsic motivation. A key insight from our algorithm is that the two seemingly opposite motivations can be reconciled—without which exploration and information-gathering cannot be effectively carried out. We supported this claim with empirical evidence, showing that boredom-enabled agents consistently outperformed other curious or explorative agent variants in model building benchmarks based on self-assisted experience accumulation.

Published in Frontiers in Neurorobotics

ISSN: 1662-5218 (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Medicine: Internal medicine: Neurosciences. Biological psychiatry. Neuropsychiatry
Website: https://www.frontiersin.org/journals/neurorobotics/

About the journal

Abstract

Keywords