Inverse kinematics solution and control method of 6-degree-of-freedom manipulator based on deep reinforcement learning

Chengyi Zhao; Yimin Wei; Junfeng Xiao; Yong Sun; Dongxing Zhang; Qiuquan Guo; Jun Yang

doi:10.1038/s41598-024-62948-6

Scientific Reports (May 2024)

Inverse kinematics solution and control method of 6-degree-of-freedom manipulator based on deep reinforcement learning

Chengyi Zhao,
Yimin Wei,
Junfeng Xiao,
Yong Sun,
Dongxing Zhang,
Qiuquan Guo,
Jun Yang

Affiliations

Chengyi Zhao: University of Electronic Science and Technology of China, Shenzhen Institute for Advanced Study
Yimin Wei: University of Electronic Science and Technology of China, Shenzhen Institute for Advanced Study
Junfeng Xiao: University of Electronic Science and Technology of China, Shenzhen Institute for Advanced Study
Yong Sun: University of Electronic Science and Technology of China, Shenzhen Institute for Advanced Study
Dongxing Zhang: University of Electronic Science and Technology of China, Shenzhen Institute for Advanced Study
Qiuquan Guo: University of Electronic Science and Technology of China, Shenzhen Institute for Advanced Study
Jun Yang: University of Electronic Science and Technology of China, Shenzhen Institute for Advanced Study

DOI: https://doi.org/10.1038/s41598-024-62948-6
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 12

Abstract

Read online

Abstract The advent of Industry 4.0 has significantly promoted the field of intelligent manufacturing, which is facilitated by the development of new technologies are emerging. Robot technology and robot intelligence methods have rapidly developed and been widely applied. Manipulators are widely used in industry, and their control is a crucial research topic. The inverse kinematics solution of manipulators is an important part of manipulator control, which calculates the joint angles required for the end effector to reach a desired position and posture. Traditional inverse kinematics solution algorithms often face the problem of insufficient generalization, and iterative methods have challenges such as large computation and long solution time. This paper proposes a reinforcement learning-based inverse kinematics solution algorithm, called the MAPPO-IK algorithm. The algorithm trains the manipulator agent using the MAPPO algorithm and calculates the difference between the end effector state of the manipulator and the target posture in real-time by designing a reward mechanism, while considering Gaussian distance and cosine distance. Through experimental comparative analysis, the feasibility, computational efficiency, and superiority of this reinforcement learning algorithm are verified. Compared with traditional inverse kinematics solution algorithms, this method has good generalization and supports real-time computation, and the obtained result is a unique solution. Reinforcement learning algorithms have better adaptability to complex environments and can handle different sudden situations in different environments. This algorithm also has the advantages of path planning, intelligent obstacle avoidance, and other advantages in dynamically processing complex environmental scenes.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal

Abstract

Keywords