Multi-UAV Cooperative Target Assignment Method Based on Reinforcement Learning

Yunlong Ding; Minchi Kuang; Heng Shi; Jiazhan Gao

doi:10.3390/drones8100562

Drones (Oct 2024)

Multi-UAV Cooperative Target Assignment Method Based on Reinforcement Learning

Yunlong Ding,
Minchi Kuang,
Heng Shi,
Jiazhan Gao

Affiliations

Yunlong Ding: School of Computer Science and Technology, Xinjiang University, Urumqi 830046, China
Minchi Kuang: School of Computer Science and Technology, Xinjiang University, Urumqi 830046, China
Heng Shi: Precision Instruments Department, Tsinghua University, Beijing 100084, China
Jiazhan Gao: School of Computer Science and Technology, Xinjiang University, Urumqi 830046, China

DOI: https://doi.org/10.3390/drones8100562
Journal volume & issue: Vol. 8, no. 10
p. 562

Abstract

Read online

To overcome the problems of traditional distributed target allocation algorithms in terms of lack of target strategic priority, poor scalability, and robustness, this paper proposes a proximal strategy optimization algorithm that combines threat assessment and attention mechanism (TAPPO). Based on the distributed training framework, the algorithm integrates a threat assessment and dynamic attention strategy and designs a dynamic reward function based on the current hit rate of the drone and the missile benefit ratio to improve the algorithm’s exploration ability and scalability. Through an 8vs8 multi-UAV confrontation experiment in a digital twin simulation environment, the results show that the agent using the TAPPO algorithm for target allocation defeats the state machine with an 85% winning rate and is significantly better than other current mainstream target allocation algorithms, verifying the effectiveness of the algorithm.

Published in Drones

ISSN: 2504-446X (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Motor vehicles. Aeronautics. Astronautics
Website: http://www.mdpi.com/journal/drones

About the journal

Abstract

Keywords