Enhancing reinforcement learning‐based ramp metering performance at freeway uncertain bottlenecks using curriculum learning

Si Zheng; Zhibin Li; Meng Li; Zemian Ke

doi:10.1049/itr2.12494

IET Intelligent Transport Systems (Oct 2024)

Enhancing reinforcement learning‐based ramp metering performance at freeway uncertain bottlenecks using curriculum learning

Si Zheng,
Zhibin Li,
Meng Li,
Zemian Ke

Affiliations

Si Zheng: School of Transportation Southeast University Nanjing China
Zhibin Li: School of Transportation Southeast University Nanjing China
Meng Li: School of Transportation Southeast University Nanjing China
Zemian Ke: Department of Civil and Environmental Engineering Carnegie Mellon University Pittsburgh Pennsylvania USA

DOI: https://doi.org/10.1049/itr2.12494
Journal volume & issue: Vol. 18, no. 10
pp. 1863 – 1878

Abstract

Read online

Abstract Most current RM approaches are developed for fixed bottlenecks. However, the number and locations of bottlenecks are usually uncertain and even time‐varying due to some unexpected phenomena, such as severe accidents and temporal lane closures. Thus, the RM approach should be able to enhance traffic flow stability by effectively handling the time‐delay effect and fluctuations in traffic flow rate caused by uncertain bottlenecks. This study proposed a novel approach called deep reinforcement learning with curriculum learning (DRLCL) to improve ramp metering efficacy under uncertain bottleneck conditions. The curriculum learning method transfers an optimal control policy from a simple on‐ramp bottleneck case to more challenging bottleneck tasks, while DRLCL agents explore and learn from the tasks gradually. Four RM control tasks were developed in the modified cell transmission model, including typical on‐ramp bottleneck, fixed downstream bottleneck, random‐location bottleneck, and multiple bottlenecks. With curriculum learning, the entire training process was reduced by 45.1% to 64.5%, while maintaining a similar maximum reward level compared to DRL‐based RM control with full learning from scratch. Specifically, the results also demonstrated that the proposed DRLCL‐based RM outperformed the feedback‐based RM due to its stronger predictive ability, faster response, and higher action precision.

Published in IET Intelligent Transport Systems

ISSN: 1751-956X (Print); 1751-9578 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Technology: Engineering (General). Civil engineering (General): Transportation engineering; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://ietresearch.onlinelibrary.wiley.com/journal/17519578

About the journal

Abstract

Keywords