A Reinforcement Learning Approach to Dynamic Trajectory Optimization with Consideration of Imbalanced Sub-Goals in Self-Driving Vehicles

Yu-Jin Kim; Woo-Jin Ahn; Sun-Ho Jang; Myo-Taeg Lim; Dong-Sung Pae

doi:10.3390/app14125213

Applied Sciences (Jun 2024)

A Reinforcement Learning Approach to Dynamic Trajectory Optimization with Consideration of Imbalanced Sub-Goals in Self-Driving Vehicles

Yu-Jin Kim,
Woo-Jin Ahn,
Sun-Ho Jang,
Myo-Taeg Lim,
Dong-Sung Pae

Affiliations

Yu-Jin Kim: Korea Institute of Science and Technology, Korea University, Seoul 02456, Republic of Korea
Woo-Jin Ahn: School of Electrical Engineering, Korea University, Seoul 02841, Republic of Korea
Sun-Ho Jang: School of Electrical Engineering, Korea University, Seoul 02841, Republic of Korea
Myo-Taeg Lim: School of Electrical Engineering, Korea University, Seoul 02841, Republic of Korea
Dong-Sung Pae: Department of Software, Sangmyung University, Cheonan 31066, Republic of Korea

DOI: https://doi.org/10.3390/app14125213
Journal volume & issue: Vol. 14, no. 12
p. 5213

Abstract

Read online

Goal-conditioned Reinforcement Learning (RL) holds promise for addressing intricate control challenges by enabling agents to learn and execute desired skills through separate decision modules. However, the irregular occurrence of required skills poses a significant challenge to effective learning. In this paper, we demonstrate the detrimental effects of this imbalanced skill (sub-goal) distribution and propose a novel training approach, Classified Experience Replay (CER), designed to mitigate this challenge. We demonstrate that adapting our method to conventional RL methods significantly enhances the performance of the RL agent. Considering the challenges inherent in tasks such as driving, characterized by biased occurrences of required sub-goals, our study demonstrates the improvement in trained outcomes facilitated by the proposed method. In addition, we introduce a specialized framework tailored for self-driving tasks on highways, integrating model predictive control into our RL trajectory optimization training paradigm. Our approach, utilizing CER with the suggested framework, yields remarkable advancements in trajectory optimization for RL agents operating in highway environments.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords