Towards Human-Level Safe Reinforcement Learning in Atari Library

Afriyadi Afriyadi; Wiranto Herry Utomo

doi:10.32736/sisfokom.v12i3.1739

Jurnal Sisfokom (Nov 2023)

Towards Human-Level Safe Reinforcement Learning in Atari Library

Afriyadi Afriyadi,
Wiranto Herry Utomo

Affiliations

Afriyadi Afriyadi: Faculty of Computing, President University
Wiranto Herry Utomo: Faculty of Computing, President University

DOI: https://doi.org/10.32736/sisfokom.v12i3.1739
Journal volume & issue: Vol. 12, no. 3
pp. 378 – 393

Abstract

Read online

Reinforcement learning (RL) is a powerful tool for training agents to perform complex tasks. However, from time-to-time RL agents often learn to behave in unsafe or unintended ways. This is especially true during the exploration phase, when the agent is trying to learn about its environment. This research acquires safe exploration methods from the field of robotics and evaluates their effectiveness compared to other algorithms that are commonly used in complex videogame environments without safe exploration. We also propose a method for hand-crafting catastrophic states, which are states that are known to be unsafe for the agent to visit. Our results show that our method and our hand-crafted safety constraints outperform state-of-the-art algorithms on relatively certain iterations. This means that our method is able to learn to behave safely while still achieving good performance. These results have implications for the future development of human-level safe learning with combination of model-based RL using complex videogame environments. By developing safe exploration methods, we can help to ensure that RL agents can be used in a variety of real-world applications, such as self-driving cars and robotics.

Published in Jurnal Sisfokom

ISSN: 2301-7988 (Print); 2581-0588 (Online)
Publisher: LPPM ISB Atma Luhur
Country of publisher: Indonesia
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology
Website: http://jurnal.atmaluhur.ac.id/index.php/sisfokom/

About the journal

Abstract

Keywords