Three-Dimensional Path Planning of UAVs in a Complex Dynamic Environment Based on Environment Exploration Twin Delayed Deep Deterministic Policy Gradient

Danyang Zhang; Xiongwei Li; Guoquan Ren; Jiangyi Yao; Kaiyan Chen; Xi Li

doi:10.3390/sym15071371

Symmetry (Jul 2023)

Three-Dimensional Path Planning of UAVs in a Complex Dynamic Environment Based on Environment Exploration Twin Delayed Deep Deterministic Policy Gradient

Danyang Zhang,
Xiongwei Li,
Guoquan Ren,
Jiangyi Yao,
Kaiyan Chen,
Xi Li

Affiliations

Danyang Zhang: Shijiazhuang Campus, Army Engineering University, Shijiazhuang 050003, China
Xiongwei Li: Shijiazhuang Campus, Army Engineering University, Shijiazhuang 050003, China
Guoquan Ren: Shijiazhuang Campus, Army Engineering University, Shijiazhuang 050003, China
Jiangyi Yao: Shijiazhuang Campus, Army Engineering University, Shijiazhuang 050003, China
Kaiyan Chen: Shijiazhuang Campus, Army Engineering University, Shijiazhuang 050003, China
Xi Li: Shijiazhuang Campus, Army Engineering University, Shijiazhuang 050003, China

DOI: https://doi.org/10.3390/sym15071371
Journal volume & issue: Vol. 15, no. 7
p. 1371

Abstract

Read online

Unmanned Aerial Vehicle (UAV) path planning research refers to the UAV automatically planning an optimal path to the destination under the corresponding environment, while avoiding collision with obstacles in this process. In order to solve the problem of 3D path planning of UAV in a dynamic environment, a heuristic dynamic reward function is designed to guide the UAV. We propose the Environment Exploration Twin Delayed Deep Deterministic Policy Gradient (EE-TD3) algorithm, which combines the symmetrical 3D environment exploration coding mechanism on the basis of TD3 algorithm. The EE-TD3 algorithm model can effectively avoid collisions, improve the training efficiency, and achieve faster convergence speed. Finally, the performance of the EE-TD3 algorithm and other deep reinforcement learning algorithms was tested in the simulation environment. The results show that the EE-TD3 algorithm is better than other algorithms in solving the 3D path planning problem of UAV.

Published in Symmetry

ISSN: 2073-8994 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Mathematics
Website: http://www.mdpi.com/journal/symmetry/

About the journal

Abstract

Keywords