Research on autonomous collision avoidance of merchant ship based on inverse reinforcement learning

Mao Zheng; Shuo Xie; Xiumin Chu; Tianquan Zhu; Guohao Tian

doi:10.1177/1729881420969081

International Journal of Advanced Robotic Systems (Nov 2020)

Research on autonomous collision avoidance of merchant ship based on inverse reinforcement learning

Mao Zheng,
Shuo Xie,
Xiumin Chu,
Tianquan Zhu,
Guohao Tian

Affiliations

Mao Zheng: National Engineering Research Center for Water Transportation Safety, Wuhan University of Technology, Wuhan, Hubei Province, People’s Republic of China
Shuo Xie: China Classification Society, Beijing, People’s Republic of China
Xiumin Chu: National Engineering Research Center for Water Transportation Safety, Wuhan University of Technology, Wuhan, Hubei Province, People’s Republic of China
Tianquan Zhu: School of Energy and Power Engineering, Wuhan University of Technology, Wuhan, Hubei Province, People’s Republic of China
Guohao Tian: School of Energy and Power Engineering, Wuhan University of Technology, Wuhan, Hubei Province, People’s Republic of China

DOI: https://doi.org/10.1177/1729881420969081
Journal volume & issue: Vol. 17

Abstract

Read online

To learn the optimal collision avoidance policy of merchant ships controlled by human experts, a finite-state Markov decision process model for ship collision avoidance is proposed based on the analysis of collision avoidance mechanism, and an inverse reinforcement learning (IRL) method based on cross entropy and projection is proposed to obtain the optimal policy from expert’s demonstrations. Collision avoidance simulations in different ship encounters are conducted and the results show that the policy obtained by the proposed IRL has a good inversion effect on two kinds of human experts, which indicate that the proposed method can effectively learn the policy of human experts for ship collision avoidance.

Published in International Journal of Advanced Robotic Systems

ISSN: 1729-8814 (Online)
Publisher: SAGE Publishing
Country of publisher: United Kingdom
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://journals.sagepub.com/home/arx

About the journal