AUV Obstacle Avoidance Planning Based on Deep Reinforcement Learning

Jianya Yuan; Hongjian Wang; Honghan Zhang; Changjian Lin; Dan Yu; Chengfeng Li

doi:10.3390/jmse9111166

Journal of Marine Science and Engineering (Oct 2021)

AUV Obstacle Avoidance Planning Based on Deep Reinforcement Learning

Jianya Yuan,
Hongjian Wang,
Honghan Zhang,
Changjian Lin,
Dan Yu,
Chengfeng Li

Affiliations

Jianya Yuan: College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin 045100, China
Hongjian Wang: College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin 045100, China
Honghan Zhang: College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin 045100, China
Changjian Lin: School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221000, China
Dan Yu: College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin 045100, China
Chengfeng Li: College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin 045100, China

DOI: https://doi.org/10.3390/jmse9111166
Journal volume & issue: Vol. 9, no. 11
p. 1166

Abstract

Read online

In a complex underwater environment, finding a viable, collision-free path for an autonomous underwater vehicle (AUV) is a challenging task. The purpose of this paper is to establish a safe, real-time, and robust method of collision avoidance that improves the autonomy of AUVs. We propose a method based on active sonar, which utilizes a deep reinforcement learning algorithm to learn the processed sonar information to navigate the AUV in an uncertain environment. We compare the performance of double deep Q-network algorithms with that of a genetic algorithm and deep learning. We propose a line-of-sight guidance method to mitigate abrupt changes in the yaw direction and smooth the heading changes when the AUV switches trajectory. The different experimental results show that the double deep Q-network algorithms ensure excellent collision avoidance performance. The effectiveness of the algorithm proposed in this paper was verified in three environments: random static, mixed static, and complex dynamic. The results show that the proposed algorithm has significant advantages over other algorithms in terms of success rate, collision avoidance performance, and generalization ability. The double deep Q-network algorithm proposed in this paper is superior to the genetic algorithm and deep learning in terms of the running time, total path, performance in avoiding collisions with moving obstacles, and planning time for each step. After the algorithm is trained in a simulated environment, it can still perform online learning according to the information of the environment after deployment and adjust the weight of the network in real-time. These results demonstrate that the proposed approach has significant potential for practical applications.

Published in Journal of Marine Science and Engineering

ISSN: 2077-1312 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Naval Science: Naval architecture. Shipbuilding. Marine engineering; Geography. Anthropology. Recreation: Oceanography
Website: http://www.mdpi.com/journal/jmse

About the journal

Abstract

Keywords