Research on air combat decision algorithm based on proximal policy optimization

ZHANG Bochao; WEN Xiaoling; LIU Lu; ZHANG Yaqian; WANG Hongguang

doi:10.16615/j.cnki.1674-8190.2023.02.17

Hangkong gongcheng jinzhan (Apr 2023)

Research on air combat decision algorithm based on proximal policy optimization

ZHANG Bochao,
WEN Xiaoling,
LIU Lu,
ZHANG Yaqian,
WANG Hongguang

Affiliations

ZHANG Bochao: Shenyang Aircraft Design and Research Institute, Aviation Industry Corporation of China, Ltd., Shenyang 110035, China
WEN Xiaoling: Shenyang Aircraft Design and Research Institute, Aviation Industry Corporation of China, Ltd., Shenyang 110035, China
LIU Lu: Shenyang Aircraft Design and Research Institute, Aviation Industry Corporation of China, Ltd., Shenyang 110035, China
ZHANG Yaqian: Shenyang Aircraft Design and Research Institute, Aviation Industry Corporation of China, Ltd., Shenyang 110035, China
WANG Hongguang: Shenyang Aircraft Design and Research Institute, Aviation Industry Corporation of China, Ltd., Shenyang 110035, China

DOI: https://doi.org/10.16615/j.cnki.1674-8190.2023.02.17
Journal volume & issue: Vol. 14, no. 2
pp. 145 – 151

Abstract

Read online

Facing the future combat scenario with manned and unmanned aerial vehicle cooperation, real-time and accurate air combat decision-making is the basis of winning. The complex air environment, transient situation data, and multiple cumbersome combat tasks make coordinated combat with unmanned aerial vehicles a trend in future air combat, replacing single machine combat. However, multi-agent modeling and training processes face difficulties in reward allocation and network convergence. Air combat scenarios for 5v5 manned and unmanned aerial vehicle cooperation, the characteristic model of single agent is abstracted in this paper, and an algorithm based on proximal policy optimization is proposed to obtain the air combat decision sequence by using reward and punishment incentive in the real-time interaction with the environment. The simulation results show that the algorithm proposed in this paper can adapt to the complex battlefield situation and get a stable and reasonable decision-making strategy in continuous action space after training and learning.

Published in Hangkong gongcheng jinzhan

ISSN: 1674-8190 (Print)
Publisher: Editorial Department of Advances in Aeronautical Science and Engineering
Country of publisher: China
LCC subjects: Technology: Motor vehicles. Aeronautics. Astronautics
Website: http://hkgcjz.cnjournals.com

About the journal

Abstract

Keywords