A Policy-Reuse Algorithm Based on Destination Position Prediction for Aircraft Guidance Using Deep Reinforcement Learning

Zhuang Wang; Yi Ai; Qinghai Zuo; Shaowu Zhou; Hui Li

doi:10.3390/aerospace9110632

Aerospace (Oct 2022)

A Policy-Reuse Algorithm Based on Destination Position Prediction for Aircraft Guidance Using Deep Reinforcement Learning

Zhuang Wang,
Yi Ai,
Qinghai Zuo,
Shaowu Zhou,
Hui Li

Affiliations

Zhuang Wang: College of Air Traffic Management, Civil Aviation Flight University of China, Guanghan 618307, China
Yi Ai: College of Air Traffic Management, Civil Aviation Flight University of China, Guanghan 618307, China
Qinghai Zuo: College of Air Traffic Management, Civil Aviation Flight University of China, Guanghan 618307, China
Shaowu Zhou: College of Air Traffic Management, Civil Aviation Flight University of China, Guanghan 618307, China
Hui Li: College of Computer Science, Sichuan University, Chengdu 610065, China

DOI: https://doi.org/10.3390/aerospace9110632
Journal volume & issue: Vol. 9, no. 11
p. 632

Abstract

Read online

Artificial intelligence for aircraft guidance is a hot research topic, and deep reinforcement learning is one of the promising methods. However, due to the different movement patterns of destinations in different guidance tasks, it is inefficient to train agents from scratch. In this article, a policy-reuse algorithm based on destination position prediction is proposed to solve this problem. First, the reward function is optimized to improve flight trajectory quality and training efficiency. Then, by predicting the possible termination position of the destinations in different moving patterns, the problem is transformed into a fixed-position destination aircraft guidance problem. Last, taking the agent in the fixed-position destination scenario as the baseline agent, a new guidance agent can be trained efficiently. Simulation results show that this method can significantly improve the training efficiency of agents in new tasks, and its performance is stable in tasks with different similarities. This research broadens the application scope of the policy-reuse approach and also enlightens the research in other fields.

Published in Aerospace

ISSN: 2226-4310 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Motor vehicles. Aeronautics. Astronautics
Website: http://www.mdpi.com/journal/aerospace

About the journal

Abstract

Keywords