Graph Pointer Network Based Hierarchical Curriculum Reinforcement Learning Method Solving Shuttle Tankers Scheduling Problem

Xiaoyong Gao; Yixu Yang; Diao Peng; Shanghe Li; Chaodong Tan; Feifei Li; Tao Chen

doi:10.23919/CSMS.2024.0017

Complex System Modeling and Simulation (Dec 2024)

Graph Pointer Network Based Hierarchical Curriculum Reinforcement Learning Method Solving Shuttle Tankers Scheduling Problem

Xiaoyong Gao,
Yixu Yang,
Diao Peng,
Shanghe Li,
Chaodong Tan,
Feifei Li,
Tao Chen

Affiliations

Xiaoyong Gao: Department of Automation, China University of Petroleum (Beijing), Beijing 102249, China
Yixu Yang: Department of Automation, China University of Petroleum (Beijing), Beijing 102249, China
Diao Peng: Department of Automation, China University of Petroleum (Beijing), Beijing 102249, China
Shanghe Li: Department of Automation, China University of Petroleum (Beijing), Beijing 102249, China
Chaodong Tan: Department of Automation, China University of Petroleum (Beijing), Beijing 102249, China
Feifei Li: Shandong NextAI Tech. Co. Ltd., Dongying 257000, China
Tao Chen: Department of Process and Chemical Engineering, University of Surrey, Guildford, GU2 7XH, UK

DOI: https://doi.org/10.23919/CSMS.2024.0017
Journal volume & issue: Vol. 4, no. 4
pp. 339 – 352

Abstract

Read online

Shuttle tankers scheduling is an important task in offshore oil and gas transportation process, which involves operating time window fulfillment, optimal transportation planning, and proper inventory management. However, conventional approaches like Mixed Integer Linear Programming (MILP) or meta heuristic algorithms often fail in long running time. In this paper, a Graph Pointer Network (GPN) based Hierarchical Curriculum Reinforcement Learning (HCRL) method is proposed to solve Shuttle Tankers Scheduling Problem (STSP). The model is trained to divide STSP into voyage and operation stages and generate routing and inventory management decisions sequentially. An asynchronous training strategy is developed to address the coupling between stages. Comparison experiments demonstrate that the proposed HCRL method achieves 12% shorter tour lengths on average compared to heuristic algorithms. Additional experiments validate its generalizability to unseen instances and scalability to larger instances.

Published in Complex System Modeling and Simulation

ISSN: 2096-9929 (Print); 2097-3705 (Online)
Publisher: Tsinghua University Press
Country of publisher: China
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science; Technology: Engineering (General). Civil engineering (General): Systems engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=9420428

About the journal

Abstract

Keywords