Learning cooperative strategies in StarCraft through role-based monotonic value function factorization

Kun Han; Feng Jiang; Haiqi Zhu; Mengxuan Shao; Ruyu Yan

doi:10.3934/era.2024037

Electronic Research Archive (Jan 2024)

Learning cooperative strategies in StarCraft through role-based monotonic value function factorization

Kun Han ,
Feng Jiang,
Haiqi Zhu ,
Mengxuan Shao,
Ruyu Yan

Affiliations

Kun Han: 1. Faculty of Computing, Harbin Institute of Technology, Harbin 150000, China
Feng Jiang: 1. Faculty of Computing, Harbin Institute of Technology, Harbin 150000, China 2. School of Medicine and Health, Harbin Institute of Technology, Harbin 150000, China
Haiqi Zhu: 1. Faculty of Computing, Harbin Institute of Technology, Harbin 150000, China
Mengxuan Shao: 1. Faculty of Computing, Harbin Institute of Technology, Harbin 150000, China
Ruyu Yan: 3. School of Management, Harbin Institute of Technology, Harbin 150000, China

DOI: https://doi.org/10.3934/era.2024037
Journal volume & issue: Vol. 32, no. 2
pp. 779 – 798

Abstract

Read online

StarCraft is a popular real-time strategy game that has been widely used as a research platform for artificial intelligence. Micromanagement refers to the process of making each unit perform appropriate actions separately, depending on the current state in the the multi-agent system comprising all of the units, i.e., the fine-grained control of individual units for common benefit. Therefore, cooperation between different units is crucially important to improve the joint strategy. We have selected multi-agent deep reinforcement learning to tackle the problem of micromanagement. In this paper, we propose a method for learning cooperative strategies in StarCraft based on role-based montonic value function factorization (RoMIX). RoMIX learns roles based on the potential impact of each agent on the multi-agent task; it then represents the action value of a role in a mixed way based on monotonic value function factorization. The final value is calculated by accumulating the action value of all roles. The role-based learning improves the cooperation between agents on the team, allowing them to learn the joint strategy more quickly and efficiently. In addition, RoMIX can also reduce storage resources to a certain extent. Experiments show that RoMIX can not only solve easy tasks, but it can also learn better cooperation strategies for more complex and difficult tasks.

Published in Electronic Research Archive

ISSN: 2688-1594 (Online)
Publisher: AIMS Press
Country of publisher: United States
LCC subjects: Science: Mathematics; Technology: Technology (General): Industrial engineering. Management engineering: Applied mathematics. Quantitative methods
Website: https://www.aimspress.com/journal/era

About the journal

Abstract

Keywords