Maneuvering target tracking of UAV based on MN-DDPG and transfer learning

Bo Li; Zhi-peng Yang; Da-qing Chen; Shi-yang Liang; Hao Ma

Defence Technology (Apr 2021)

Maneuvering target tracking of UAV based on MN-DDPG and transfer learning

Bo Li,
Zhi-peng Yang,
Da-qing Chen,
Shi-yang Liang,
Hao Ma

Affiliations

Bo Li: School of Electronics and Information, Northwestern Polytechnical University, Xi’an, 710072, China; Corresponding author.
Zhi-peng Yang: School of Electronics and Information, Northwestern Polytechnical University, Xi’an, 710072, China
Da-qing Chen: School of Engineering, London South Bank University, London, SE1 0AA, UK
Shi-yang Liang: School of Electronics and Information, Northwestern Polytechnical University, Xi’an, 710072, China
Hao Ma: AVIC Xi’an Aeronautics Computing Technique Research Institute, Xi’an, 710068, China

Journal volume & issue: Vol. 17, no. 2
pp. 457 – 466

Abstract

Read online

Tracking maneuvering target in real time autonomously and accurately in an uncertain environment is one of the challenging missions for unmanned aerial vehicles (UAVs). In this paper, aiming to address the control problem of maneuvering target tracking and obstacle avoidance, an online path planning approach for UAV is developed based on deep reinforcement learning. Through end-to-end learning powered by neural networks, the proposed approach can achieve the perception of the environment and continuous motion output control. This proposed approach includes: (1) A deep deterministic policy gradient (DDPG)-based control framework to provide learning and autonomous decision-making capability for UAVs; (2) An improved method named MN-DDPG for introducing a type of mixed noises to assist UAV with exploring stochastic strategies for online optimal planning; and (3) An algorithm of task-decomposition and pre-training for efficient transfer learning to improve the generalization capability of UAV’s control model built based on MN-DDPG. The experimental simulation results have verified that the proposed approach can achieve good self-adaptive adjustment of UAV’s flight attitude in the tasks of maneuvering target tracking with a significant improvement in generalization capability and training efficiency of UAV tracking controller in uncertain environments.

Published in Defence Technology

ISSN: 2214-9147 (Online)
Publisher: KeAi Communications Co., Ltd.
Country of publisher: China
LCC subjects: Military Science
Website: https://www.keaipublishing.com/en/journals/defence-technology/

About the journal

Abstract

Keywords