Intelligent Attitude Control of Hypersonic Vehicle Based on DDQN and Deep Q-Learning from Demonstrations

Liu Jingwen, Cai Guangbin, Fan Yonghua, Fan Hongdong, Wu Tong, Shang Yiming

doi:10.12132/issn.1673-5048.2024.0130

Hangkong bingqi (Dec 2024)

Intelligent Attitude Control of Hypersonic Vehicle Based on DDQN and Deep Q-Learning from Demonstrations

Liu Jingwen, Cai Guangbin, Fan Yonghua, Fan Hongdong, Wu Tong, Shang Yiming

Affiliations

Liu Jingwen, Cai Guangbin, Fan Yonghua, Fan Hongdong, Wu Tong, Shang Yiming: 1. College of Missile Engineering, Rocket Force University of Engineering, Xi’an 710025, China;2. School of Astronautics, Northwestern Polytechnic University, Xi’an 710072, China

DOI: https://doi.org/10.12132/issn.1673-5048.2024.0130
Journal volume & issue: Vol. 31, no. 6
pp. 50 – 56

Abstract

Read online

In order to improve the speed and accuracy of solving the attitude control problem of hypersonic vehicle, an intelligent attitude control method of hypersonic vehicle based on demonstration learning is proposed. Firstly, the control model of hypersonic vehicle is established, and the appropriate action is selected as the attitude control output. Secondly, an algorithm based on DDQN (Double Deep Q-Network) and DQfD (Deep Q-learning from Demonstrations) is designed, which divides the training of agents into two stages: pre-training and formal training. In the pre-training stage, the agent extracts small batch data from the demonstration data, and applies four loss functions to update the neural network. In the formal training phase, samples are taken from the data generated by its own training and demonstration data, and the proportion of two types of data in each small batch is controlled through priority experience replay buffer. Learning through interaction with the environment, so that the hypersonic vehicle can adaptively adjust its attitude according to changes in the flight environment. The simulation results show that the reinforcement learning method based on demonstration data can track control command, realize the attitude control of hypersonic vehicle, and improve the performance of neural network in the early stage of training, with a higher average reward.

|hypersonic vehicle|attitude control|reinforcement learning|learning from demonstrations|ddqn

Published in Hangkong bingqi

ISSN: 1673-5048 (Print)
Publisher: Editorial Office of Aero Weaponry
Country of publisher: China
LCC subjects: Technology: Motor vehicles. Aeronautics. Astronautics
Website: http://www.aeroweaponry.avic.com/EN/1673-5048/home.shtml

About the journal

Abstract

Keywords