State of the Art of Adaptive Dynamic Programming and Reinforcement Learning

Derong Liu; Mingming Ha; Shan Xue

doi:10.26599/AIR.2022.9150007

CAAI Artificial Intelligence Research (Dec 2022)

State of the Art of Adaptive Dynamic Programming and Reinforcement Learning

Derong Liu,
Mingming Ha,
Shan Xue

Affiliations

Derong Liu: Department of Mechanical and Energy Engineering, Southern University of Science and Technology, Shenzhen 518055, China
Mingming Ha: School of Automation and Electrical Engineering, University of Science and Technology Beijing, Beijing 100083, China
Shan Xue: School of Computer Science and Engineering, South China University of Technology, Guangzhou 510006, China

DOI: https://doi.org/10.26599/AIR.2022.9150007
Journal volume & issue: Vol. 1, no. 2
pp. 93 – 110

Abstract

Read online

This article introduces the state-of-the-art development of adaptive dynamic programming and reinforcement learning (ADPRL). First, algorithms in reinforcement learning (RL) are introduced and their roots in dynamic programming are illustrated. Adaptive dynamic programming (ADP) is then introduced following a brief discussion of dynamic programming. Researchers in ADP and RL have enjoyed the fast developments of the past decade from algorithms, to convergence and optimality analyses, and to stability results. Several key steps in the recent theoretical developments of ADPRL are mentioned with some future perspectives. In particular, convergence and optimality results of value iteration and policy iteration are reviewed, followed by an introduction to the most recent results on stability analysis of value iteration algorithms.

Published in CAAI Artificial Intelligence Research

ISSN: 2097-194X (Print); 2097-3691 (Online)
Publisher: Tsinghua University Press
Country of publisher: China
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.sciopen.com/journal/2097-194X

About the journal

Abstract

Keywords