Learning an Accurate State Transition Dynamics Model by Fitting Both a Function and its Derivative

Youngho Kim; Hoosang Lee; Jeha Ryu

doi:10.1109/ACCESS.2022.3169798

IEEE Access (Jan 2022)

Learning an Accurate State Transition Dynamics Model by Fitting Both a Function and its Derivative

Youngho Kim,
Hoosang Lee,
Jeha Ryu

Affiliations

Youngho Kim: ORCiD; Robot Reinforcement Learning Laboratory, School of Integrated Technology, Gwangju Institute of Science and Technology, Gwangju, South Korea
Hoosang Lee: ORCiD; Robot Reinforcement Learning Laboratory, School of Integrated Technology, Gwangju Institute of Science and Technology, Gwangju, South Korea
Jeha Ryu: ORCiD; Robot Reinforcement Learning Laboratory, School of Integrated Technology, Gwangju Institute of Science and Technology, Gwangju, South Korea

DOI: https://doi.org/10.1109/ACCESS.2022.3169798
Journal volume & issue: Vol. 10
pp. 44248 – 44258

Abstract

Read online

Learning accurate state transition dynamics model in a sample-efficient way is important to predict the future states from the current states and actions of a system both accurately and efficiently in model-based reinforcement learning for many robotic applications. This study proposes a sample-efficient learning approach that can accurately learn a state transition dynamics model by fitting both the predicted next states and their derivatives. The derivatives of the feedforward neural network output (next states) with respect to the inputs (current states and actions) are computed using chain rules. In addition, the effect of the activation functions on the learning derivatives are illustrated via sum of elementary sine functions example and the values are compared with various other activation functions with respect to accuracy. The proposed learning approach exhibits significant improvement in accuracy for both one-step and multi-step prediction cases with a six-degree-of-freedom manipulation robot (UR-10) in both simulation and real environments.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords