Leader–follower UAVs formation control based on a deep Q-network collaborative framework

Zhijun Liu; Jie Li; Jian Shen; Xiaoguang Wang; Pengyun Chen

doi:10.1038/s41598-024-54531-w

Scientific Reports (Feb 2024)

Leader–follower UAVs formation control based on a deep Q-network collaborative framework

Zhijun Liu,
Jie Li,
Jian Shen,
Xiaoguang Wang,
Pengyun Chen

Affiliations

Zhijun Liu: Shenzhen MSU-BIT University
Jie Li: Shenzhen MSU-BIT University
Jian Shen: School of Mechanical and Electrical Engineering, North University of China
Xiaoguang Wang: Department of Advanced Technology, Norinco Group Aviation Ammunition Research Institute
Pengyun Chen: School of Aerospace Engineering, North University of China

DOI: https://doi.org/10.1038/s41598-024-54531-w
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 15

Abstract

Read online

Abstract This study examines a collaborative framework that utilizes an intelligent deep Q-network to regulate the formation of leader–follower Unmanned Aerial Vehicles (UAVs). The aim is to tackle the challenges posed by the highly dynamic and uncertain flight environment of UAVs. In the context of UAVs, we have developed a dynamic model that captures the collective state of the system. This model encompasses variables like as the relative positions, heading angle, rolling angle, and velocity of different nodes in the formation. In the subsequent section, we elucidate the operational procedure of UAVs in a collaborative manner, employing the conceptual framework of Markov Decision Process (MDP). Furthermore, we employ the Reinforcement Learning (RL) to facilitate this process. In light of this premise, a fundamental framework is presented for addressing the control problem of UAVs utilizing the DQN scheme. This framework encompasses a technique for action selection known as $$\varepsilon$$ ε -imitation, as well as algorithmic specifics. Finally, the efficacy and portability of the DQN-based approach are substantiated by numerical simulation validation. The average reward curve demonstrates a satisfactory level of convergence, and kinematic link between the nodes inside the formation satisfies the essential requirements for the creation of a controller.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal