Multi-Agent Cooperation Based on Reinforcement Learning with Internal Reward in Maze Problem

Fumito Uwano; Naoki Tatebe; Yusuke Tajima; Masaya Nakata; Tim Kovacs; Keiki Takadama

doi:10.9746/jcmsi.11.321

SICE Journal of Control, Measurement, and System Integration (Jul 2018)

Multi-Agent Cooperation Based on Reinforcement Learning with Internal Reward in Maze Problem

Fumito Uwano,
Naoki Tatebe,
Yusuke Tajima,
Masaya Nakata,
Tim Kovacs,
Keiki Takadama

Affiliations

Fumito Uwano: The University of Electro-Communications
Naoki Tatebe: The University of Electro-Communications
Yusuke Tajima: The University of Electro-Communications
Masaya Nakata: The University of Electro-Communications
Tim Kovacs: University of Bristol
Keiki Takadama: The University of Electro-Communications

DOI: https://doi.org/10.9746/jcmsi.11.321
Journal volume & issue: Vol. 11, no. 4
pp. 321 – 330

Abstract

Read online

This paper introduces a reinforcement learning technique with an internal reward for a multi-agent cooperation task. The proposed methods is an extension of Q-learning which changes the ordinary (external) reward to the internal reward for agent-cooperation. Specifically, we propose here two Q-learning methods, both of which employ the internal reward for the less or no communication. To guarantee the effectiveness of the proposed methods, we theoretically derived the mechanisms that solve the following questions: (1) how the internal rewards should be set to guarantee the cooperation among the agents under the condition of less and no communication; and (2) how the values of the cooperative behaviors types (i.e., the varieties of the cooperative behaviors of the agents) should be updated under the condition of no communication. The intensive simulations on the maze problem for the agent-cooperation task have been revealed that our two proposed methods successfully enable the agents to acquire their cooperative behaviors even in less or no communication, while the conventional method (Q-learning) always fails to acquire such behaviors.

Published in SICE Journal of Control, Measurement, and System Integration

ISSN: 1884-9970 (Online)
Publisher: Taylor & Francis Group
Country of publisher: United Kingdom
LCC subjects: Technology: Mechanical engineering and machinery: Control engineering systems. Automatic machinery (General)
Website: https://www.tandfonline.com/journals/tmsi

About the journal

Abstract

Keywords