Bayesian Deep Reinforcement Learning Algorithm for Solving Deep Exploration Problems

YANG Min, WANG Jie

doi:10.3778/j.issn.1673-9418.1901020

Jisuanji kexue yu tansuo (Feb 2020)

Bayesian Deep Reinforcement Learning Algorithm for Solving Deep Exploration Problems

YANG Min, WANG Jie

Affiliations

YANG Min, WANG Jie: College of Information Science and Engineering, Central South University, Changsha 410083, China

DOI: https://doi.org/10.3778/j.issn.1673-9418.1901020
Journal volume & issue: Vol. 14, no. 2
pp. 307 – 316

Abstract

Read online

In the field of reinforcement learning, how to balance the relationship between exploration and exploi-tation is a hard problem. The reinforcement learning method proposed in recent years mainly focuses on how to combine the deep learning technology to improve the generalization ability of the algorithm, but ignores the explo-ration-exploitation dilemma. The traditional reinforcement learning method can effectively solve the exploration problem, but there are certain restrictions: the state space of the Markov decision process must be discrete and limited. In this paper, the Bayesian method is proposed to improve the efficiency of deep reinforcement algorithm. And the main contribution is to extend the method of calculating the posterior distribution of parameters in Bayesian linear regression to nonlinear models such as artificial neural networks. By combining Bootstrapped DQN (deep Q-network) and the computational method proposed in this paper, Bayesian Bootstrapped DQN (BBDQN) is obtained. Finally, the results of the experiments in two environments show that BBDQN is more efficient than DQN and Bootstrapped DQN in the face of deep exploration.

Published in Jisuanji kexue yu tansuo

ISSN: 1673-9418 (Print)
Publisher: Journal of Computer Engineering and Applications Beijing Co., Ltd., Science Press
Country of publisher: China
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://fcst.ceaj.org

About the journal

Abstract

Keywords