International Journal of Advanced Robotic Systems (Nov 2020)

Research on autonomous collision avoidance of merchant ship based on inverse reinforcement learning

  • Mao Zheng,
  • Shuo Xie,
  • Xiumin Chu,
  • Tianquan Zhu,
  • Guohao Tian

DOI
https://doi.org/10.1177/1729881420969081
Journal volume & issue
Vol. 17

Abstract

Read online

To learn the optimal collision avoidance policy of merchant ships controlled by human experts, a finite-state Markov decision process model for ship collision avoidance is proposed based on the analysis of collision avoidance mechanism, and an inverse reinforcement learning (IRL) method based on cross entropy and projection is proposed to obtain the optimal policy from expert’s demonstrations. Collision avoidance simulations in different ship encounters are conducted and the results show that the policy obtained by the proposed IRL has a good inversion effect on two kinds of human experts, which indicate that the proposed method can effectively learn the policy of human experts for ship collision avoidance.