A Novel Multi-Agent Parallel-Critic Network Architecture for Cooperative-Competitive Reinforcement Learning

Yu Sun; Jun Lai; Lei Cao; Xiliang Chen; Zhixiong Xu; Yue Xu

doi:10.1109/ACCESS.2020.3011670

IEEE Access (Jan 2020)

A Novel Multi-Agent Parallel-Critic Network Architecture for Cooperative-Competitive Reinforcement Learning

Yu Sun,
Jun Lai,
Lei Cao,
Xiliang Chen,
Zhixiong Xu,
Yue Xu

Affiliations

Yu Sun: ORCiD; Command and Control Engineering College, Army Engineering University of PLA, Nanjing, China
Jun Lai: ORCiD; Command and Control Engineering College, Army Engineering University of PLA, Nanjing, China
Lei Cao: ORCiD; Command and Control Engineering College, Army Engineering University of PLA, Nanjing, China
Xiliang Chen: ORCiD; Command and Control Engineering College, Army Engineering University of PLA, Nanjing, China
Zhixiong Xu: ORCiD; Command and Control Engineering College, Army Engineering University of PLA, Nanjing, China
Yue Xu: ORCiD; The PLA Unit 31102, Nanjing, China

DOI: https://doi.org/10.1109/ACCESS.2020.3011670
Journal volume & issue: Vol. 8
pp. 135605 – 135616

Abstract

Read online

Multi-agent deep reinforcement learning (MDRL) is an emerging research hotspot and application direction in the field of machine learning and artificial intelligence. MDRL covers many algorithms, rules and frameworks, it is currently researched in swarm system, energy allocation optimization, stocking analysis, sequential social dilemma, and with extremely bright future. In this paper, a parallel-critic method based on classic MDRL algorithm MADDPG is proposed to alleviate the training instability problem in cooperative-competitive multi-agent environment. Furthermore, a policy smoothing technique is introduced to our proposed method to decrease the variance of learning policies. The suggested method is evaluated in three different scenarios of authoritative multi-agent particle environment (MPE). Multiple statistical data of experimental results show that our method significantly improves the training stability and performance compared to vanilla MADDPG.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords