An efficient model‐free adaptive optimal control of continuous‐time nonlinear non‐zero‐sum games based on integral reinforcement learning with exploration

Lei Guo; Wenbo Xiong; Yuan Song; Dongming Gan

doi:10.1049/cth2.12610

IET Control Theory & Applications (Apr 2024)

An efficient model‐free adaptive optimal control of continuous‐time nonlinear non‐zero‐sum games based on integral reinforcement learning with exploration

Lei Guo,
Wenbo Xiong,
Yuan Song,
Dongming Gan

Affiliations

Lei Guo: School of Artificial Intelligence Beijing University of Posts and Telecommunications Beijing China
Wenbo Xiong: School of Artificial Intelligence Beijing University of Posts and Telecommunications Beijing China
Yuan Song: School of Artificial Intelligence Beijing University of Posts and Telecommunications Beijing China
Dongming Gan: School of Engineering Technology Purdue University West Lafayette Indiana USA

DOI: https://doi.org/10.1049/cth2.12610
Journal volume & issue: Vol. 18, no. 6
pp. 748 – 763

Abstract

Read online

Abstract To reduce the learning time and space occupation, this study presents a novel model‐free algorithm for obtaining the Nash equilibrium solution of continuous‐time nonlinear non‐zero‐sum games. Based on the integral reinforcement learning method, a new integral HJ equation that can quickly and cooperatively determine the Nash equilibrium strategies of all players is proposed. By leveraging the neural network approximation and gradient descent method, simultaneous continuous‐time adaptive tuning laws are provided for both critic and actor neural network weights. These laws facilitate the estimation of the optimal value function and optimal policy without requiring knowledge or identification of the system's dynamics. The closed‐loop system stability and convergence of weights are guaranteed through the Lyapunov analysis. Additionally, the algorithm is enhanced to reduce the number of auxiliary NNs used in the critic. The simulation results for a two‐player non‐zero‐sum game validate the effectiveness of the proposed algorithm.

Published in IET Control Theory & Applications

ISSN: 1751-8644 (Print); 1751-8652 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Technology: Mechanical engineering and machinery: Control engineering systems. Automatic machinery (General)
Website: https://ietresearch.onlinelibrary.wiley.com/journal/17518652

About the journal

Abstract

Keywords