An Improved Ant Colony Algorithm with Deep Reinforcement Learning for the Robust Multiobjective AGV Routing Problem in Assembly Workshops

Yong Chen; Mingyu Chen; Feiyang Yu; Han Lin; Wenchao Yi

doi:10.3390/app14167135

Applied Sciences (Aug 2024)

An Improved Ant Colony Algorithm with Deep Reinforcement Learning for the Robust Multiobjective AGV Routing Problem in Assembly Workshops

Yong Chen,
Mingyu Chen,
Feiyang Yu,
Han Lin,
Wenchao Yi

Affiliations

Yong Chen: College of Mechanical Engineering, Zhejiang University of Technology, Hangzhou 310014, China
Mingyu Chen: College of Mechanical Engineering, Zhejiang University of Technology, Hangzhou 310014, China
Feiyang Yu: College of Mechanical Engineering, Zhejiang University of Technology, Hangzhou 310014, China
Han Lin: College of Mechanical Engineering, Zhejiang University of Technology, Hangzhou 310014, China
Wenchao Yi: College of Mechanical Engineering, Zhejiang University of Technology, Hangzhou 310014, China

DOI: https://doi.org/10.3390/app14167135
Journal volume & issue: Vol. 14, no. 16
p. 7135

Abstract

Read online

Vehicle routing problems (VRPs) are challenging problems. Many variants of the VRP have been proposed. However, few studies on VRP have combined robustness and just-in-time (JIT) requirements with uncertainty. To solve the problem, this paper proposes the just-in-time-based robust multiobjective vehicle routing problem with time windows (JIT-RMOVRPTW) for the assembly workshop. Based on the conflict between uncertain time and JIT requirements, a JIT strategy was proposed. To measure the robustness of the solution, a metric was designed as the objective. Afterwards, a two-stage nondominated sorting ant colony algorithm with deep reinforcement learning (NSACOWDRL) was proposed. In stage I, ACO combines with NSGA-III to obtain the Pareto frontier. Based on the model, a pheromone update strategy and a transfer probability formula were designed. DDQN was introduced as a local search algorithm which trains networks through Pareto solutions to participate in probabilistic selection and nondominated sorting. In stage II, the Pareto frontier was quantified in feasibility by Monte Carlo simulation, and tested by diversity-robust selection based on uniformly distributed weights in the solution space to select robust Pareto solutions that take diversity into account. The effectiveness of NSACOWDRL was demonstrated through comparative experiments with other algorithms on instances. The impact of JIT strategy is analyzed and the effect of networks on the NSACOWDRL is further discussed.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords