Learning Improvement Heuristics for Multi-Unmanned Aerial Vehicle Task Allocation

Boyang Fan; Yuming Bo; Xiang Wu

doi:10.3390/drones8110636

Drones (Nov 2024)

Learning Improvement Heuristics for Multi-Unmanned Aerial Vehicle Task Allocation

Boyang Fan,
Yuming Bo,
Xiang Wu

Affiliations

Boyang Fan: School of Automation, Nanjing University of Science and Technology, Nanjing 210094, China
Yuming Bo: School of Automation, Nanjing University of Science and Technology, Nanjing 210094, China
Xiang Wu: School of Automation, Nanjing University of Science and Technology, Nanjing 210094, China

DOI: https://doi.org/10.3390/drones8110636
Journal volume & issue: Vol. 8, no. 11
p. 636

Abstract

Read online

Nowadays, small UAV swarms with the capability of carrying inexpensive munitions have been highly effective in strike missions against ground targets on the battlefield. Effective task allocation is crucial for improving the overall operational effectiveness of these UAV swarms. Traditional heuristic methods for addressing the task allocation problem often rely on handcrafted rules, which may limit their performance for the complicated tasks. In this paper, a NeuroSelect Discrete Particle Swarm Optimization (NSDPSO) algorithm is presented for the Multi-UAV Task Allocation (MUTA) problem. Specifically, a Transformer-based model is proposed to learn design NeuroSelect Heuristic for DPSO to improve the evolutionary process. The iteration of DPSO is modeled as a decomposed Markov Decision Process (MDP), and a reinforcement learning algorithm is employed to train the network parameters. The simulation results are provided to verify the effectiveness of the proposed method.

Published in Drones

ISSN: 2504-446X (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Motor vehicles. Aeronautics. Astronautics
Website: http://www.mdpi.com/journal/drones

About the journal

Abstract

Keywords