SICE Journal of Control, Measurement, and System Integration (Dec 2023)

A tutorial introduction to reinforcement learning

  • Mathukumalli Vidyasagar

DOI
https://doi.org/10.1080/18824889.2023.2196033
Journal volume & issue
Vol. 16, no. 1
pp. 172 – 191

Abstract

Read online

In this paper, we present a brief survey of reinforcement learning, with particular emphasis on stochastic approximation (SA) as a unifying theme. The scope of the paper includes Markov reward processes, Markov decision processes, SA algorithms, and widely used algorithms such as temporal difference learning and Q-learning.

Keywords