Sensors (Jul 2021)

CACLA-Based Trajectory Tracking Guidance for RLV in Terminal Area Energy Management Phase

  • Xuejing Lan,
  • Zhifeng Tan,
  • Tao Zou,
  • Wenbiao Xu

DOI
https://doi.org/10.3390/s21155062
Journal volume & issue
Vol. 21, no. 15
p. 5062

Abstract

Read online

This paper focuses on the trajectory tracking guidance problem for the Terminal Area Energy Management (TAEM) phase of the Reusable Launch Vehicle (RLV). Considering the continuous state and action space of this guidance problem, the Continuous Actor–Critic Learning Automata (CACLA) is applied to construct the guidance strategy of RLV. Two three-layer neuron networks are used to model the critic and actor of CACLA, respectively. The weight vectors of the critic are updated by the model-free Temporal Difference (TD) learning algorithm, which is improved by eligibility trace and momentum factor. The weight vectors of the actor are updated based on the sign of TD error, and a Gauss exploration is carried out in the actor. Finally, a Monte Carlo simulation and a comparison simulation are performed to show the effectiveness of the CACLA-based guidance strategy.

Keywords