Manipulation-Compliant Artificial Potential Field and Deep Q-Network: Large Ships Path Planning Based on Deep Reinforcement Learning and Artificial Potential Field

Weifeng Xu; Xiang Zhu; Xiaori Gao; Xiaoyong Li; Jianping Cao; Xiaoli Ren; Chengcheng Shao

doi:10.3390/jmse12081334

Journal of Marine Science and Engineering (Aug 2024)

Manipulation-Compliant Artificial Potential Field and Deep Q-Network: Large Ships Path Planning Based on Deep Reinforcement Learning and Artificial Potential Field

Weifeng Xu,
Xiang Zhu,
Xiaori Gao,
Xiaoyong Li,
Jianping Cao,
Xiaoli Ren,
Chengcheng Shao

Affiliations

Weifeng Xu: College of Meteorology and Oceanography, National University of Defense Technology, Changsha 410073, China
Xiang Zhu: College of Meteorology and Oceanography, National University of Defense Technology, Changsha 410073, China
Xiaori Gao: Navigation College, Dalian Maritime University, Dalian 116026, China
Xiaoyong Li: College of Meteorology and Oceanography, National University of Defense Technology, Changsha 410073, China
Jianping Cao: College of Meteorology and Oceanography, National University of Defense Technology, Changsha 410073, China
Xiaoli Ren: College of Meteorology and Oceanography, National University of Defense Technology, Changsha 410073, China
Chengcheng Shao: College of Meteorology and Oceanography, National University of Defense Technology, Changsha 410073, China

DOI: https://doi.org/10.3390/jmse12081334
Journal volume & issue: Vol. 12, no. 8
p. 1334

Abstract

Read online

Enhancing the path planning capabilities of ships is crucial for ensuring navigation safety, saving time, and reducing energy consumption in complex maritime environments. Traditional methods, reliant on static algorithms and singular models, are frequently limited by the physical constraints of ships, such as turning radius, and struggle to adapt to the maritime environment’s variability and emergencies. The development of reinforcement learning has introduced new methods and perspectives to path planning by addressing complex environments, achieving multi-objective optimization, and enhancing autonomous learning and adaptability, significantly improving the performance and application scope. In this study, we introduce a two-stage path planning approach for large ships named MAPF–DQN, combining Manipulation-Compliant Artificial Potential Field (MAPF) with Deep Q-Network (DQN). In the first stage, we improve the reward function in DQN by integrating the artificial potential field method and use a time-varying greedy algorithm to search for paths. In the second stage, we use the nonlinear Nomoto model for path smoothing to enhance maneuverability. To validate the performance and effectiveness of the algorithm, we conducted extensive experiments using the model of “Yupeng” ship. Case studies and experimental results demonstrate that the MAPF–DQN algorithm can find paths that closely match the actual trajectory under normal environmental conditions and U-shaped obstacles. In summary, the MAPF–DQN algorithm not only enhances the efficiency of path planning for large ships, but also finds relatively safe and maneuverable routes, which are of great significance for maritime activities.

Published in Journal of Marine Science and Engineering

ISSN: 2077-1312 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Naval Science: Naval architecture. Shipbuilding. Marine engineering; Geography. Anthropology. Recreation: Oceanography
Website: http://www.mdpi.com/journal/jmse

About the journal

Abstract

Keywords