Reinforcement Learning for Mean-Field Game

Mridul Agarwal; Vaneet Aggarwal; Arnob Ghosh; Nilay Tiwari

doi:10.3390/a15030073

Algorithms (Feb 2022)

Reinforcement Learning for Mean-Field Game

Mridul Agarwal,
Vaneet Aggarwal,
Arnob Ghosh,
Nilay Tiwari

Affiliations

Mridul Agarwal: School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN 47907, USA
Vaneet Aggarwal: School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN 47907, USA
Arnob Ghosh: Department of Electrical and Computer Engineering, Ohio State University, Columbus, OH 43210, USA
Nilay Tiwari: Department of Electrical Engineering, I.I.T. Kanpur, Kanpur 208016, UP, India

DOI: https://doi.org/10.3390/a15030073
Journal volume & issue: Vol. 15, no. 3
p. 73

Abstract

Read online

Stochastic games provide a framework for interactions among multiple agents and enable a myriad of applications. In these games, agents decide on actions simultaneously. After taking an action, the state of every agent updates to the next state, and each agent receives a reward. However, finding an equilibrium (if exists) in this game is often difficult when the number of agents becomes large. This paper focuses on finding a mean-field equilibrium (MFE) in an action-coupled stochastic game setting in an episodic framework. It is assumed that an agent can approximate the impact of the other agents’ by the empirical distribution of the mean of the actions. All agents know the action distribution and employ lower-myopic best response dynamics to choose the optimal oblivious strategy. This paper proposes a posterior sampling-based approach for reinforcement learning in the mean-field game, where each agent samples a transition probability from the previous transitions. We show that the policy and action distributions converge to the optimal oblivious strategy and the limiting distribution, respectively, which constitute an MFE.

Published in Algorithms

ISSN: 1999-4893 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.mdpi.com/journal/algorithms

About the journal

Abstract

Keywords