A Reliability Quantification Method for Deep Reinforcement Learning-Based Control

Hitoshi Yoshioka; Hirotada Hashimoto

doi:10.3390/a17070314

Algorithms (Jul 2024)

A Reliability Quantification Method for Deep Reinforcement Learning-Based Control

Hitoshi Yoshioka,
Hirotada Hashimoto

Affiliations

Hitoshi Yoshioka: Graduate School of Engineering, Osaka Metropolitan University, Sakai 599-8531, Osaka, Japan
Hirotada Hashimoto: Graduate School of Engineering, Osaka Metropolitan University, Sakai 599-8531, Osaka, Japan

DOI: https://doi.org/10.3390/a17070314
Journal volume & issue: Vol. 17, no. 7
p. 314

Abstract

Read online

Reliability quantification of deep reinforcement learning (DRL)-based control is a significant challenge for the practical application of artificial intelligence (AI) in safety-critical systems. This study proposes a method for quantifying the reliability of DRL-based control. First, an existing method, random network distillation, was applied to the reliability evaluation to clarify the issues to be solved. Second, a novel method for reliability quantification was proposed to solve these issues. The reliability is quantified using two neural networks: a reference and an evaluator. They have the same structure with the same initial parameters. The outputs of the two networks were the same before training. During training, the evaluator network parameters were updated to maximize the difference between the reference and evaluator networks for trained data. Thus, the reliability of the DRL-based control for a state can be evaluated based on the difference in output between the two networks. The proposed method was applied to DRL-based controls as an example of a simple task, and its effectiveness was demonstrated. Finally, the proposed method was applied to the problem of switching trained models depending on the state. Consequently, the performance of the DRL-based control was improved by switching the trained models according to their reliability.

Published in Algorithms

ISSN: 1999-4893 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.mdpi.com/journal/algorithms

About the journal

Abstract

Keywords