npj Science of Learning (Jun 2021)

A micro-genesis account of longer-form reinforcement learning in structured and unstructured environments

  • Benjamin James Dyson,
  • Ahad Asad

DOI
https://doi.org/10.1038/s41539-021-00098-4
Journal volume & issue
Vol. 6, no. 1
pp. 1 – 5

Abstract

Read online

Abstract We explored the possibility that in order for longer-form expressions of reinforcement learning (win-calmness, loss-restlessness) to manifest across tasks, they must first develop because of micro-transactions within tasks. We found no evidence of win-calmness or loss-restlessness when wins could not be maximised (unexploitable opponents), nor when the threat of win minimisation was presented (exploiting opponents), but evidence of win-calmness (but not loss-restlessness) when wins could be maximised (exploitable opponents).