Online Safe Flight Control Method Based on Constraint Reinforcement Learning

Jiawei Zhao; Haotian Xu; Zhaolei Wang; Tao Zhang

doi:10.3390/drones8090429

Drones (Aug 2024)

Online Safe Flight Control Method Based on Constraint Reinforcement Learning

Jiawei Zhao,
Haotian Xu,
Zhaolei Wang,
Tao Zhang

Affiliations

Jiawei Zhao: Department of Automatic Control, Xi’an Research Institute of Hi-Tech, Xi’an 710025, China
Haotian Xu: Department of Automation, Tsinghua University, Beijing 100091, China
Zhaolei Wang: Beijing Aerospace Automatic Control Institute, Beijing 100854, China
Tao Zhang: Department of Automation, Tsinghua University, Beijing 100091, China

DOI: https://doi.org/10.3390/drones8090429
Journal volume & issue: Vol. 8, no. 9
p. 429

Abstract

Read online

UAVs are increasingly prominent in the competition for space due to their multiple characteristics, such as strong maneuverability, long flight distance, and high survivability. A new online safe flight control method based on constrained reinforcement learning is proposed for the intelligent safety control of UAVs. This method adopts constrained policy optimization as the main reinforcement learning framework and develops a constrained policy optimization algorithm with extra safety budget, which introduces Lyapunov stability requirements and limits rudder deflection loss to ensure flight safety and improves the robustness of the controller. By efficiently interacting with the constructed simulation environment, a control law model for UAVs is trained. Subsequently, a condition-triggered meta-learning online learning method is used to adjust the control raw online ensuring successful attitude angle tracking. Simulation experimental results show that using online control laws to perform aircraft attitude angle control tasks has an overall score of 100 points. After introducing online learning, the adaptability of attitude control to comprehensive errors such as aerodynamic parameters and wind improved by 21% compared to offline learning. The control law can be learned online to adjust the control policy of UAVs, ensuring their safety and stability during flight.

Published in Drones

ISSN: 2504-446X (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Motor vehicles. Aeronautics. Astronautics
Website: http://www.mdpi.com/journal/drones

About the journal

Abstract

Keywords