Motion planning framework based on dual-agent DDPG method for dual-arm robots guided by human joint angle constraints

Keyao Liang; Fusheng Zha; Wei Guo; Shengkai Liu; Pengfei Wang; Lining Sun

doi:10.3389/fnbot.2024.1362359

Frontiers in Neurorobotics (Feb 2024)

Motion planning framework based on dual-agent DDPG method for dual-arm robots guided by human joint angle constraints

Keyao Liang,
Fusheng Zha,
Wei Guo,
Shengkai Liu,
Pengfei Wang,
Lining Sun

Affiliations

Keyao Liang
Fusheng Zha
Wei Guo
Shengkai Liu
Pengfei Wang
Lining Sun

DOI: https://doi.org/10.3389/fnbot.2024.1362359
Journal volume & issue: Vol. 18

Abstract

Read online

IntroductionReinforcement learning has been widely used in robot motion planning. However, for multi-step complex tasks of dual-arm robots, the trajectory planning method based on reinforcement learning still has some problems, such as ample exploration space, long training time, and uncontrollable training process. Based on the dual-agent depth deterministic strategy gradient (DADDPG) algorithm, this study proposes a motion planning framework constrained by the human joint angle, simultaneously realizing the humanization of learning content and learning style. It quickly plans the coordinated trajectory of dual-arm for complex multi-step tasks.MethodsThe proposed framework mainly includes two parts: one is the modeling of human joint angle constraints. The joint angle is calculated from the human arm motion data measured by the inertial measurement unit (IMU) by establishing a human-robot dual-arm kinematic mapping model. Then, the joint angle range constraints are extracted from multiple groups of demonstration data and expressed as inequalities. Second, the segmented reward function is designed. The human joint angle constraint guides the exploratory learning process of the reinforcement learning method in the form of step reward. Therefore, the exploration space is reduced, the training speed is accelerated, and the learning process is controllable to a certain extent.Results and discussionThe effectiveness of the framework was verified in the gym simulation environment of the Baxter robot's reach-grasp-align task. The results show that in this framework, human experience knowledge has a significant impact on the guidance of learning, and this method can more quickly plan the coordinated trajectory of dual-arm for multi-step tasks.

Published in Frontiers in Neurorobotics

ISSN: 1662-5218 (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Medicine: Internal medicine: Neurosciences. Biological psychiatry. Neuropsychiatry
Website: https://www.frontiersin.org/journals/neurorobotics/

About the journal

Abstract

Keywords