IEEE Access (Jan 2020)

An Improved DDPG and Its Application Based on the Double-Layer BP Neural Network

  • Mingli Zhang,
  • Yijie Zhang,
  • Zhengjie Gao,
  • Xiaolong He

DOI
https://doi.org/10.1109/ACCESS.2020.3020590
Journal volume & issue
Vol. 8
pp. 177734 – 177744

Abstract

Read online

This paper focused on three application problems of the traditional Deep Deterministic Policy Gradient(DDPG) algorithm. That is, the agent exploration is insufficient, the neural network performance is unsatisfied, the agent output fluctuates greatly. In terms of agent exploration strategy, network training algorithm and overall algorithm implementation, an improved DDPG method based on double-layer BP neural network is proposed. This method introduces fuzzy algorithm and BFGS algorithm based on Armijo-Goldstein criterion, improves the exploration efficiency, learning efficiency and convergence of BP neural network, increases the number of layers of BP neural network to improve the fitting ability of the network, and adopts periodic update to ensure the stable operation of the algorithm. The experimental results show that the deep learning network based on the improved DDPG algorithm has greatly improved the performance compared with the traditional method after multiple rounds of self-learning under variable working conditions. This study lays a theoretical and experimental foundation for the extended application of deep learning algorithm.

Keywords