Improved MATD3 algorithm and its adversarial application

WANG  Kun, ZHAO  Yingce, WANG  Guangyao, LI  Jianxun

doi:10.3969/j.issn.1673-3819.2024.05.011

Zhihui kongzhi yu fangzhen (Oct 2024)

Improved MATD3 algorithm and its adversarial application

WANG Kun, ZHAO Yingce, WANG Guangyao, LI Jianxun

Affiliations

WANG Kun, ZHAO Yingce, WANG Guangyao, LI Jianxun: 1 Shanghai Jiao Tong University Department of Automation, Shanghai 200240, China;2 Shenyang Aircraft Design and Research Institute, Shenyang 110035, China

DOI: https://doi.org/10.3969/j.issn.1673-3819.2024.05.011
Journal volume & issue: Vol. 46, no. 5
pp. 77 – 84

Abstract

Read online

Improving the training effect of multi-agent has always been the focus in the field of reinforcement learning. Based on the multi-Agent twin-delay deep deterministic policy gradient (MATD3) algorithm, a parameter sharing mechanism is introduced to improve training efficiency. At the same time, in order to alleviate the inconsistency between real rewards and auxiliary rewards, drawing on the ideas of course learning, a decay factor for auxiliary rewards is proposed to ensure the motivation of policy exploration in the early training period and the reward consistency in the late training period. And the proposed improved MATD3 algorithm is applied to combat vehicle games to achieve intelligent decision-making of the vehicle. The application results show that the reward curve of the vehicle converges stably and the effect is good. Besides, the improved algorithm is compared with the original MATD3 algorithm, and the simulation results verify that the improved algorithm can effectively improve the effect of convergence and the convergence value of reward.

reinforcement learning|parameter sharing|reward consistency|intelligent decision-making

Published in Zhihui kongzhi yu fangzhen

ISSN: 1673-3819 (Print)
Publisher: Editorial Office of Command Control and Simulation
Country of publisher: China
LCC subjects: Military Science
Website: https://www.zhkzyfz.cn/EN/home

About the journal

Abstract

Keywords