Efficient Hindsight Experience Replay with Transformed Data Augmentation

Jiazheng Sun; Weiguang Li

doi:10.6180/jase.202402_27(2).0011

Journal of Applied Science and Engineering (Aug 2023)

Efficient Hindsight Experience Replay with Transformed Data Augmentation

Jiazheng Sun,
Weiguang Li

Affiliations

Jiazheng Sun: School of Mechanical and Automotive Engineering, South China University of Technology Guangzhou, Guangdong, China
Weiguang Li: School of Mechanical and Automotive Engineering, South China University of Technology Guangzhou, Guangdong, China

DOI: https://doi.org/10.6180/jase.202402_27(2).0011
Journal volume & issue: Vol. 27, no. 1
pp. 2097 – 2108

Abstract

Read online

Motion control of robots is a high-dimensional, nonlinear control problem that is often difficult to handle using traditional dynamical path planning means. Reinforcement learning is currently an effective means to solve robot motion control problems, but reinforcement learning has disadvantages such as high number of trials and errors and sparse rewards, which restrict the application efficiency of reinforcement learning. The Hindsight Experience Replay(HER) algorithm is a reinforcement learning algorithm that solves the reward sparsity problem by constructing virtual target values. However, the HER algorithm still suffers from the problem of long time in the early stage of training, and there is still room for improving its sample utilization efficiency. Augmentation by existing data to improve training efficiency has been widely used in supervised learning, but is less applied in the field of reinforcement learning. In this paper, we propose the Hindsight Experience Replay with Transformed Data Augmentation (TDAHER) algorithm by constructing a transformed data augmentation method for reinforcement learning samples, combined with the HER algorithm. And in order to solve the problem of the accuracy of the augmented samples in the later stage of training, the decaying participation factor method is introduced. After the comparison of four simulated robot control tasks, it is proved that the algorithm can effectively improve the training efficiency of reinforcement learning.

Published in Journal of Applied Science and Engineering

ISSN: 2708-9967 (Print); 2708-9975 (Online)
Publisher: Tamkang University Press
Country of publisher: Taiwan, Province of China
LCC subjects: Technology: Engineering (General). Civil engineering (General); Technology: Chemical technology: Chemical engineering; Science: Physics
Website: http://jase.tku.edu.tw/

About the journal

Abstract

Keywords