Stability-certified reinforcement learning control via spectral normalization

Ryoichi Takase; Nobuyuki Yoshikawa; Toshisada Mariyama; Takeshi Tsuchiya

Machine Learning with Applications (Dec 2022)

Stability-certified reinforcement learning control via spectral normalization

Ryoichi Takase,
Nobuyuki Yoshikawa,
Toshisada Mariyama,
Takeshi Tsuchiya

Affiliations

Ryoichi Takase: Department of Aeronautics and Astronautics, The University of Tokyo, Tokyo, Japan; Corresponding author.
Nobuyuki Yoshikawa: Information Technology R&D Center, Mitsubishi Electric Corporation, Kanagawa, Japan
Toshisada Mariyama: Information Technology R&D Center, Mitsubishi Electric Corporation, Kanagawa, Japan
Takeshi Tsuchiya: Department of Aeronautics and Astronautics, The University of Tokyo, Tokyo, Japan

Journal volume & issue: Vol. 10
p. 100409

Abstract

Read online

In this study, two types of methods from different perspectives based on spectral normalization (SN) are described for ensuring the stability of a feedback system controlled by a neural network (NN). The first one is that the L2gain of the feedback system is bounded less than 1 to satisfy a stability condition derived from the small-gain theorem. When explicitly including the stability condition, the first type of method may provide an insufficient performance on the NN controller due to its strict stability condition. To overcome this difficulty, the second type of method is proposed, ensuring local stability with a larger region of attraction. In this second type, the stability is ensured by solving linear matrix inequalities after training the NN controller. SN improves the feasibility of the a posteriori stability test by constructing tighter local sectors. Numerical experiments show that the second type of method provides sufficient performance compared with the first one and ensures sufficient stability compared with existing reinforcement learning algorithms.11 Project page: https://sites.google.com/g.ecc.u-tokyo.ac.jp/stability-certified-rl-via-sn.

Published in Machine Learning with Applications

ISSN: 2666-8270 (Online)
Publisher: Elsevier
Country of publisher: United Kingdom
LCC subjects: Science: Science (General): Cybernetics; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.journals.elsevier.com/machine-learning-with-applications

About the journal

Abstract

Keywords