AUV Dynamic Obstacle Avoidance Method Based on Improved PPO Algorithm

Guohao Zhu; Zhou Shen; Laiyuan Liu; Sicong Zhao; Fangzheng Ji; Zixia Ju; Jialong Sun

doi:10.1109/ACCESS.2022.3223382

IEEE Access (Jan 2022)

AUV Dynamic Obstacle Avoidance Method Based on Improved PPO Algorithm

Guohao Zhu,
Zhou Shen,
Laiyuan Liu,
Sicong Zhao,
Fangzheng Ji,
Zixia Ju,
Jialong Sun

Affiliations

Guohao Zhu: School of Geomatics and Marine Information, Jiangsu Ocean University, Lianyungang, China
Zhou Shen: School of Geomatics and Marine Information, Jiangsu Ocean University, Lianyungang, China
Laiyuan Liu: School of Geomatics and Marine Information, Jiangsu Ocean University, Lianyungang, China
Sicong Zhao: School of Geomatics and Marine Information, Jiangsu Ocean University, Lianyungang, China
Fangzheng Ji: School of Geomatics and Marine Information, Jiangsu Ocean University, Lianyungang, China
Zixia Ju: School of Geomatics and Marine Information, Jiangsu Ocean University, Lianyungang, China
Jialong Sun: ORCiD; School of Geomatics and Marine Information, Jiangsu Ocean University, Lianyungang, China

DOI: https://doi.org/10.1109/ACCESS.2022.3223382
Journal volume & issue: Vol. 10
pp. 121340 – 121351

Abstract

Read online

Designing a reasonable obstacle avoidance method for AUV 3D path planning is difficult, and existing obstacle avoidance methods have certain drawbacks. For example, they are only applicable to 2D planar applications and cannot effectively handle dynamic obstacles. To address these problems, we design an obstacle collision prediction model (CPM). Based on the results of the simulation of obstacles’ inertial motion, the safety of the AUV navigation is evaluated to improve the model’s sensitivity to dynamic obstacles. Then, we enhance the learning ability of the sequence sample data by combining it with a long short-term memory (LSTM) network, thus improving the training efficiency and effect of the algorithm. The trained proximal policy optimization (PPO) network can output reasonable actions in order to control the AUV to avoid obstacles, forming an AUV 3D dynamic obstacle avoidance strategy based on the CPM-LSTM-PPO algorithm. The simulation results show that the proposed algorithm has good generalization in uncertain environments. Moreover, it achieves dynamic AUV obstacle avoidance in different three-dimensional unknown environments, providing theoretical and technical support for real path planning.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords