Hangkong bingqi (Dec 2024)

Intelligent Attitude Control of Hypersonic Vehicle Based on DDQN and Deep Q-Learning from Demonstrations

  • Liu Jingwen, Cai Guangbin, Fan Yonghua, Fan Hongdong, Wu Tong, Shang Yiming

DOI
https://doi.org/10.12132/issn.1673-5048.2024.0130
Journal volume & issue
Vol. 31, no. 6
pp. 50 – 56

Abstract

Read online

In order to improve the speed and accuracy of solving the attitude control problem of hypersonic vehicle, an intelligent attitude control method of hypersonic vehicle based on demonstration learning is proposed. Firstly, the control model of hypersonic vehicle is established, and the appropriate action is selected as the attitude control output. Secondly, an algorithm based on DDQN (Double Deep Q-Network) and DQfD (Deep Q-learning from Demonstrations) is designed, which divides the training of agents into two stages: pre-training and formal training. In the pre-training stage, the agent extracts small batch data from the demonstration data, and applies four loss functions to update the neural network. In the formal training phase, samples are taken from the data generated by its own training and demonstration data, and the proportion of two types of data in each small batch is controlled through priority experience replay buffer. Learning through interaction with the environment, so that the hypersonic vehicle can adaptively adjust its attitude according to changes in the flight environment. The simulation results show that the reinforcement learning method based on demonstration data can track control command, realize the attitude control of hypersonic vehicle, and improve the performance of neural network in the early stage of training, with a higher average reward.

Keywords