A UAV Maneuver Decision-Making Algorithm for Autonomous Airdrop Based on Deep Reinforcement Learning

Ke Li; Kun Zhang; Zhenchong Zhang; Zekun Liu; Shuai Hua; Jianliang He

doi:10.3390/s21062233

Sensors (Mar 2021)

A UAV Maneuver Decision-Making Algorithm for Autonomous Airdrop Based on Deep Reinforcement Learning

Ke Li,
Kun Zhang,
Zhenchong Zhang,
Zekun Liu,
Shuai Hua,
Jianliang He

Affiliations

Ke Li: School of Electronics and Information, Northwestern Polytechnical University, Xi’an 710072, China
Kun Zhang: School of Electronics and Information, Northwestern Polytechnical University, Xi’an 710072, China
Zhenchong Zhang: School of Electronics and Information, Northwestern Polytechnical University, Xi’an 710072, China
Zekun Liu: School of Electronics and Information, Northwestern Polytechnical University, Xi’an 710072, China
Shuai Hua: School of Electronics and Information, Northwestern Polytechnical University, Xi’an 710072, China
Jianliang He: Science and Technology on Electro-Optic Control Laboratory, Luoyang 471009, China

DOI: https://doi.org/10.3390/s21062233
Journal volume & issue: Vol. 21, no. 6
p. 2233

Abstract

Read online

How to operate an unmanned aerial vehicle (UAV) safely and efficiently in an interactive environment is challenging. A large amount of research has been devoted to improve the intelligence of a UAV while performing a mission, where finding an optimal maneuver decision-making policy of the UAV has become one of the key issues when we attempt to enable the UAV autonomy. In this paper, we propose a maneuver decision-making algorithm based on deep reinforcement learning, which generates efficient maneuvers for a UAV agent to execute the airdrop mission autonomously in an interactive environment. Particularly, the training set of the learning algorithm by the Prioritized Experience Replay is constructed, that can accelerate the convergence speed of decision network training in the algorithm. It is shown that a desirable and effective maneuver decision-making policy can be found by extensive experimental results.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords