Multi-UAV pursuit-evasion gaming based on PSO-M3DDPG schemes

Yaozhong Zhang; Meiyan Ding; Jiandong Zhang; Qiming Yang; Guoqing Shi; Meiqu Lu; Frank Jiang

doi:10.1007/s40747-024-01504-1

Complex & Intelligent Systems (Jun 2024)

Multi-UAV pursuit-evasion gaming based on PSO-M3DDPG schemes

Yaozhong Zhang,
Meiyan Ding,
Jiandong Zhang,
Qiming Yang,
Guoqing Shi,
Meiqu Lu,
Frank Jiang

Affiliations

Yaozhong Zhang: Northern Polytechnical University
Meiyan Ding: Northern Polytechnical University
Jiandong Zhang: Northern Polytechnical University
Qiming Yang: Northern Polytechnical University
Guoqing Shi: Northern Polytechnical University
Meiqu Lu: Guangxi Minzu University
Frank Jiang: UTS: University of Technology Sydney

DOI: https://doi.org/10.1007/s40747-024-01504-1
Journal volume & issue: Vol. 10, no. 5
pp. 6867 – 6883

Abstract

Read online

Abstract The sample data for reinforcement learning algorithms often exhibit sparsity and instability, making the training results susceptible to falling into local optima. Mini-Max-Multi-agent Deep Deterministic Policy Gradient (M3DDPG) algorithm is a multi-agent reinforcement learning algorithm, which introduces the minimax theorem into Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithm. It also has unstable convergence caused by sparse sample data and randomization. However, the Particle Swarm Optimisation (PSO) algorithm, unlike traditional reinforcement learning methods, involves the construction of independent populations of policy networks to generate sample data, followed by training the reinforcement learning algorithm. PSO optimizes and updates the policy population based on a fitness function, aiming to enhance the efficiency and convergence speed of the algorithm in learning from the sample data. In order to address the multi-agent pursuit-evasion problem, we propose the PSO-M3DDPG algorithm, which combines the PSO algorithm with the M3DDPG algorithm. Through experimental simulations, the improved algorithm demonstrates superior training results and faster convergence speeds, thus validating its effectiveness.

Published in Complex & Intelligent Systems

ISSN: 2199-4536 (Print); 2198-6053 (Online)
Publisher: Springer
Country of publisher: Switzerland
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science; Technology: Technology (General): Industrial engineering. Management engineering: Information technology
Website: https://www.springer.com/journal/40747

About the journal

Abstract

Keywords