Comparative Analysis of Reinforcement Learning Algorithms for Bipedal Robot Locomotion

Omur Aydogmus; Musa Yilmaz

doi:10.1109/ACCESS.2023.3344393

IEEE Access (Jan 2024)

Comparative Analysis of Reinforcement Learning Algorithms for Bipedal Robot Locomotion

Omur Aydogmus,
Musa Yilmaz

Affiliations

Omur Aydogmus: Department of Mechatronics Engineering, Faculty of Technology, Fırat University, Elazig, Turkey
Musa Yilmaz: ORCiD; Center for Environmental Research and Technology, Bourns College of Engineering, University of California at Riverside, Riverside, CA, USA

DOI: https://doi.org/10.1109/ACCESS.2023.3344393
Journal volume & issue: Vol. 12
pp. 7490 – 7499

Abstract

Read online

In this research, an optimization methodology was introduced for improving bipedal robot locomotion controlled by reinforcement learning (RL) algorithms. Specifically, the study focused on optimizing the Proximal Policy Optimization (PPO), Advantage Actor-Critic (A2C), Soft Actor-Critic (SAC), and Twin Delayed Deep Deterministic Policy Gradients (TD3) algorithms. The optimization process utilized the Tree-structured Parzen Estimator (TPE), a Bayesian optimization technique. All RL algorithms were applied to the same environment, which was created within the OpenAI GYM framework and known as the bipedal walker. The optimization involved the fine-tuning of key hyperparameters, including learning rate, discount factor, generalized advantage estimation, entropy coefficient, and Polyak update parameters. The study comprehensively analyzed the impact of these hyperparameters on the performance of RL algorithms. The results of the optimization efforts were promising, as the fine-tuned RL algorithms demonstrated significant improvements in performance. The mean reward values for the 10 trials were as follows: PPO achieved an average reward of 181.3, A2C obtained an average reward of −122.2, SAC reached an average reward of 320.3, and TD3 had an average reward of 278.6. These outcomes underscore the effectiveness of the optimization approach in enhancing the locomotion capabilities of the bipedal robot using RL techniques.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords