An Improved Dynamic Window Approach Based on Reinforcement Learning for the Trajectory Planning of Automated Guided Vehicles

Da Jiang; Ling Du; Shuhui Li; Meijing Wang; Hongchao Zhang; Xiaole Chen; Yunlong Sun

doi:10.1109/ACCESS.2024.3373446

IEEE Access (Jan 2024)

An Improved Dynamic Window Approach Based on Reinforcement Learning for the Trajectory Planning of Automated Guided Vehicles

Da Jiang,
Ling Du,
Shuhui Li,
Meijing Wang,
Hongchao Zhang,
Xiaole Chen,
Yunlong Sun

Affiliations

Da Jiang: ORCiD; National Engineering Laboratory for Wheeled Vehicle, China North Vehicle Research Institute, Beijing, China
Ling Du: ORCiD; National Engineering Laboratory for Wheeled Vehicle, China North Vehicle Research Institute, Beijing, China
Shuhui Li: National Engineering Laboratory for Wheeled Vehicle, China North Vehicle Research Institute, Beijing, China
Meijing Wang: National Engineering Laboratory for Wheeled Vehicle, China North Vehicle Research Institute, Beijing, China
Hongchao Zhang: National Engineering Laboratory for Wheeled Vehicle, China North Vehicle Research Institute, Beijing, China
Xiaole Chen: National Engineering Laboratory for Wheeled Vehicle, China North Vehicle Research Institute, Beijing, China
Yunlong Sun: National Engineering Laboratory for Wheeled Vehicle, China North Vehicle Research Institute, Beijing, China

DOI: https://doi.org/10.1109/ACCESS.2024.3373446
Journal volume & issue: Vol. 12
pp. 36016 – 36025

Abstract

Read online

The traditional dynamic window approach (DWA) adopts the constant intervals for the sampling window, which limits the trajectory exploration possibility. This paper employs the twin delayed deep deterministic policy gradient (TD3) approach to generate a reinforcement-learning-based auxiliary candidate trajectory with variable sampling mechanism in the prediction domain for the automated guided vehicle (AGV). Subsequently, this auxiliary trajectory would compete with the traditional DWA sampling trajectories in the optimal evaluation. The proposed method significantly reduces computational costs while expanding the search space of the DWA scheme, which improving sampling utilization efficiency and planning effectiveness. In contrast to completed data-driven methods that directly generate planning solutions through the policy networks, the proposed method overall ensures planning effectiveness by the DWA mechanism unit and demonstrates superior generalization capabilities. Simulation results reveal that the proposed reinforcement-learning-based DWA generates the improvement with 11.97% in planning reward and 17.46% in calculation efficiency towards the traditional DWA approach, which demonstrating a significant performance improvement over the traditional DWA method in AGV local planning mission.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords