Multi-Agent Distributed Deep Deterministic Policy Gradient for Partially Observable Tracking

Dongyu Fan; Haikuo Shen; Lijing Dong

doi:10.3390/act10100268

Actuators (Oct 2021)

Multi-Agent Distributed Deep Deterministic Policy Gradient for Partially Observable Tracking

Dongyu Fan,
Haikuo Shen,
Lijing Dong

Affiliations

Dongyu Fan: School of Mechanical, Electronic and Control Engineering, Beijing Jiaotong University, Beijing 100044, China
Haikuo Shen: School of Mechanical, Electronic and Control Engineering, Beijing Jiaotong University, Beijing 100044, China
Lijing Dong: School of Mechanical, Electronic and Control Engineering, Beijing Jiaotong University, Beijing 100044, China

DOI: https://doi.org/10.3390/act10100268
Journal volume & issue: Vol. 10, no. 10
p. 268

Abstract

Read online

In many existing multi-agent reinforcement learning tasks, each agent observes all the other agents from its own perspective. In addition, the training process is centralized, namely the critic of each agent can access the policies of all the agents. This scheme has certain limitations since every single agent can only obtain the information of its neighbor agents due to the communication range in practical applications. Therefore, in this paper, a multi-agent distributed deep deterministic policy gradient (MAD3PG) approach is presented with decentralized actors and distributed critics to realize multi-agent distributed tracking. The distinguishing feature of the proposed framework is that we adopted the multi-agent distributed training with decentralized execution, where each critic only takes the agent’s and the neighbor agents’ policies into account. Experiments were conducted in the distributed tracking tasks based on multi-agent particle environments where N(N=3,N=5) agents track a target agent with partial observation. The results showed that the proposed method achieves a higher reward with a shorter training time compared to other methods, including MADDPG, DDPG, PPO, and DQN. The proposed novel method leads to a more efficient and effective multi-agent tracking.

Published in Actuators

ISSN: 2076-0825 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Materials of engineering and construction. Mechanics of materials; Technology: Electrical engineering. Electronics. Nuclear engineering: Production of electric energy or power. Powerplants. Central stations
Website: http://www.mdpi.com/journal/actuators

About the journal

Abstract

Keywords