A new reinforcement learning-based variable speed limit control approach to improve traffic efficiency against freeway jam waves

None, None; None, None; None, None; None, None; None, None; None, None

A new reinforcement learning-based variable speed limit control approach to improve traffic efficiency against freeway jam waves

Journal Article (2022)

Author(s)

Yu Han (Southeast University)

Andreas Hegyi (TU Delft - Transport and Planning)

Le Zhang (Nanjing University of Science and Technology)

Zhengbing He (Beijing University of Technology)

Edward Chung (The Hong Kong Polytechnic University)

Pan Liu (Southeast University)

Transport and Planning

DOI related publication

https://doi.org/10.1016/j.trc.2022.103900

Reinforcement learning Variable speed limits Data-driven approach Freeway traffic control

To reference this document use:

https://resolver.tudelft.nl/uuid:b76f9664-9e49-48e4-b56e-4df2b9f0b67b

More Info

expand_more

Publication Year

2022

Language

English

Transport and Planning

Bibliographical Note

Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.@en

Volume number

144

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Conventional reinforcement learning (RL) models of variable speed limit (VSL) control systems (and traffic control systems in general) cannot be trained in real traffic process because new control actions are usually explored randomly, which may result in high costs (delays) due to exploration and learning. For this reason, existing RL-based VSL control approaches need a traffic simulator for training. However, the performance of those approaches are dependent on the accuracy of the simulators. This paper proposes a new RL-based VSL control approach to overcome the aforementioned problems. The proposed VSL control approach is designed to improve traffic efficiency by using VSLs against freeway jam waves. It applies an iterative training framework, where the optimal control policy is updated by exploring new control actions both online and offline in each iteration. The explored control actions are evaluated in real traffic process, thus it avoids that the RL model learns only from a traffic simulator. The proposed VSL control approach is tested using a macroscopic traffic simulation model to represent real world traffic flow dynamics. By comparing with existing VSL control approaches, the proposed approach is demonstrated to have advantages in the following two aspects: (i) it alleviates the impact of model mismatch, which occurs in both model-based VSL control approaches and existing RL-based VSL control approaches, via replacing knowledge from the models by knowledge from the real process, and (ii) it significantly reduces the exploration and learning costs compared to existing RL-based VSL control approaches.

Files

1_s2.0_S0968090X22003138_main.... (pdf)

(pdf | 4.58 Mb)

- Embargo expired in 01-07-2023

License info not available