Navigation in Restricted Channels Under Environmental Conditions: Fast-Time Simulation by Asynchronous Deep Reinforcement Learning

Jose Amendola; Lucas S. Miura; Anna H. Reali Costa; Fabio G. Cozman; Eduardo Aoun Tannuri

doi:10.1109/ACCESS.2020.3015661

IEEE Access (Jan 2020)

Navigation in Restricted Channels Under Environmental Conditions: Fast-Time Simulation by Asynchronous Deep Reinforcement Learning

Jose Amendola,
Lucas S. Miura,
Anna H. Reali Costa,
Fabio G. Cozman,
Eduardo Aoun Tannuri

Affiliations

Jose Amendola: ORCiD; Numerical Offshore Tank Laboratory, University of São Paulo, São Paulo, Brazil
Lucas S. Miura: Numerical Offshore Tank Laboratory, University of São Paulo, São Paulo, Brazil
Anna H. Reali Costa: ORCiD; Intelligent Techniques Laboratory, University of São Paulo, São Paulo, Brazil
Fabio G. Cozman: ORCiD; Department of Mechatronics Engineering and Mechanical Systems, University of São Paulo, São Paulo, Brazil
Eduardo Aoun Tannuri: ORCiD; Department of Mechatronics Engineering and Mechanical Systems, University of São Paulo, São Paulo, Brazil

DOI: https://doi.org/10.1109/ACCESS.2020.3015661
Journal volume & issue: Vol. 8
pp. 149199 – 149213

Abstract

Read online

This paper proposes an efficient method, based on reinforcement learning, to be used as ship controller in fast-time simulators within restricted channels. The controller must operate the rudder in a realistic manner in both time and angle variation so as to approximate human piloting. The method is well suited to scenarios where no previous navigation data is available; it takes into account, during training, both the effect of environmental conditions and also curves in channels. We resort to an asynchronous distributed version of the reinforcement learning algorithm Deep Q Network (DQN), handling channel segments as separate episodes and including curvature information as context variables (thus moving away from most work in the literature). We tested our proposal in the channel of Porto Sudeste, in the southern Brazilian coast, with realistic environment scenarios where wind and current incidence varies along the channel. The method keeps a simple representation and can be applied to any port channel configuration that respects local technical regulations.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords