Alpha C2&#x2013;An Intelligent Air Defense Commander Independent of Human Decision-Making

Qiang Fu; Cheng-Li Fan; Yafei Song; Xiang-Ke Guo

doi:10.1109/ACCESS.2020.2993459

IEEE Access (Jan 2020)

Alpha C2–An Intelligent Air Defense Commander Independent of Human Decision-Making

Qiang Fu,
Cheng-Li Fan,
Yafei Song,
Xiang-Ke Guo

Affiliations

Qiang Fu: ORCiD; College of Air and Missile Defense, Air Force Engineering University, Xi’an, China
Cheng-Li Fan: ORCiD; College of Air and Missile Defense, Air Force Engineering University, Xi’an, China
Yafei Song: ORCiD; College of Air and Missile Defense, Air Force Engineering University, Xi’an, China
Xiang-Ke Guo: ORCiD; College of Air and Missile Defense, Air Force Engineering University, Xi’an, China

DOI: https://doi.org/10.1109/ACCESS.2020.2993459
Journal volume & issue: Vol. 8
pp. 87504 – 87516

Abstract

Read online

The ultimate goal of military intelligence is to equip the command and control (C2) system with the decision-making art of excellent human commanders and to be more agile and stable than human beings. Intelligent commander Alpha C2 solves the dynamic decision-making problem in the complex scenarios of air defense operations using a deep reinforcement learning framework. Unlike traditional C2 systems that rely on expert rules and decision-making models, Alpha C2 interacts with digital battlefields close to the real world and generates learning data. By integrating the states of multiple parties as input, a gated recurrent unit network is used to introduce historical information, and an attention mechanism selects the object of action, making the output decision more reliable. Without learning human combat experience, the neural network is trained in fixed- and random-strategy scenarios based on a proximal policy optimization algorithm. Finally, 1,000 rounds of offline confrontation were conducted on a digital battlefield, whose results show that the generalization ability of Alpha C2 trained using a random strategy is better, and that it can defeat an opponent with a higher winning rate than an Expert C2 system (72% vs 21%). The use of resources is more reasonable than Expert C2, reflecting the flexible and changeable art of command.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords