A new reinforcement learning-based variable speed limit control approach to improve traffic efficiency against freeway jam waves

Journal Article (2022)
Author(s)

Yu Han (Southeast University)

A Hegyi (TU Delft - Transport and Planning)

Le Zhang (Nanjing University of Science and Technology)

Zhengbing He (Beijing University of Technology)

Edward Chung (The Hong Kong Polytechnic University)

Pan Liu (Southeast University)

Transport and Planning
Copyright
© 2022 Yu Han, A. Hegyi, Le Zhang, Zhengbing He, Edward Chung, Pan Liu
DOI related publication
https://doi.org/10.1016/j.trc.2022.103900
More Info
expand_more
Publication Year
2022
Language
English
Copyright
© 2022 Yu Han, A. Hegyi, Le Zhang, Zhengbing He, Edward Chung, Pan Liu
Transport and Planning
Bibliographical Note
Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.@en
Volume number
144
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Conventional reinforcement learning (RL) models of variable speed limit (VSL) control systems (and traffic control systems in general) cannot be trained in real traffic process because new control actions are usually explored randomly, which may result in high costs (delays) due to exploration and learning. For this reason, existing RL-based VSL control approaches need a traffic simulator for training. However, the performance of those approaches are dependent on the accuracy of the simulators. This paper proposes a new RL-based VSL control approach to overcome the aforementioned problems. The proposed VSL control approach is designed to improve traffic efficiency by using VSLs against freeway jam waves. It applies an iterative training framework, where the optimal control policy is updated by exploring new control actions both online and offline in each iteration. The explored control actions are evaluated in real traffic process, thus it avoids that the RL model learns only from a traffic simulator. The proposed VSL control approach is tested using a macroscopic traffic simulation model to represent real world traffic flow dynamics. By comparing with existing VSL control approaches, the proposed approach is demonstrated to have advantages in the following two aspects: (i) it alleviates the impact of model mismatch, which occurs in both model-based VSL control approaches and existing RL-based VSL control approaches, via replacing knowledge from the models by knowledge from the real process, and (ii) it significantly reduces the exploration and learning costs compared to existing RL-based VSL control approaches.

Files

1_s2.0_S0968090X22003138_main.... (pdf)
(pdf | 4.58 Mb)
- Embargo expired in 01-07-2023
License info not available