Fault-Tolerant Optimal Attitude Tracking Control of Quadrotor Subject to State and Input Constraints Using Safe Reinforcement Learning

Sajad Roshanravan; Saeed Shamaghdari

مکانیک هوافضا (Apr 2024)

Fault-Tolerant Optimal Attitude Tracking Control of Quadrotor Subject to State and Input Constraints Using Safe Reinforcement Learning

Sajad Roshanravan,
Saeed Shamaghdari

Affiliations

Sajad Roshanravan: Ph.D. Student, Faculty of Electrical Engineering, Iran University of Science and Technology, Tehran, Iran
Saeed Shamaghdari: Corresponding author: Associate Professor, Faculty of Electrical Engineering, Iran University of Science and Technology, Tehran, Iran

Journal volume & issue: Vol. 20, no. 1
pp. 143 – 162

Abstract

Read online

In this article, a method for designing a fault-tolerant optimal attitude tracking control (FTOATC) for a quadrotor UAV subject to component and actuator faults is presented. The proposed fault-tolerant method is based on safe reinforcement learning (SRL) and is capable of ensuring input and state constraints without the need for prior knowledge of the quadrotor dynamics. To this end, the proposed optimal method is presented with a dual neural network (NN) structure consisting of identifier-critic neural networks. In the identifier NN update law, in addition to considering the variable forgetting factor dependent on measurement noise, the experience response method is used, which increases convergence speed and robustness to measurement noise and reduces estimation error. In this method, solving the constrained FTOATC problem is equivalent to solving an unconstrained optimal stabilization problem for an augmented system, where control input constraints and states are guaranteed by selecting suitable cost functions on the input signal and appropriate control barrier functions (CBF)on the states, respectively. Furthermore, fault detection is performed without the need for any model or filter bank, simply by comparing the residual value of the Hamilton-Jacobi-Bellman (HJB) equation with a predetermined threshold. The Uniformly Ultimately Boundedness (UUB) of identifier and critic NN weight errors and, as a result, the convergence of the control input to the neighborhood of the optimal solution are all proved by Lyapunov theory and the performance of the method is validated through simulation results.

Published in مکانیک هوافضا

ISSN: 2645-5323 (Print); 2980-8103 (Online)
Publisher: Imam Hussein University
Country of publisher: Iran, Islamic Republic of
LCC subjects: Technology: Mechanical engineering and machinery
Website: https://maj.ihu.ac.ir/?lang=en

About the journal

Abstract

Keywords