Automated lane changing using deep reinforcement learning: a user-acceptance case study

Master Thesis (2021)
Author(s)

D.J. van den Haak (TU Delft - Mechanical Engineering)

Contributor(s)

Joost C.F.De de Winter – Mentor (TU Delft - Human-Robot Interaction)

P. Bazilinskyy – Mentor (TU Delft - Human-Robot Interaction)

D Dodou – Coach (TU Delft - Medical Instruments & Bio-Inspired Technology)

Meng Wang Wang – Coach (TU Delft - Transport and Planning)

Faculty
Mechanical Engineering
Copyright
© 2021 Daniel van den Haak
More Info
expand_more
Publication Year
2021
Language
English
Copyright
© 2021 Daniel van den Haak
Graduation Date
23-02-2021
Awarding Institution
Delft University of Technology
Programme
['Mechanical Engineering']
Faculty
Mechanical Engineering
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Lane change decision-making is an important challenge for automated vehicles, urging the need for high performance algorithms that are able to handle complex traffic situations. Deep reinforcement learning (DRL), a machine learning method based on artificial neural networks, has recently become a popular choice for modelling the lane change decision-making process, outperforming various traditional rule-based models. So far, performance has often been expressed in terms of achieved average speed, absence of collisions or merging success rate. However, no studies have investigated how humans will react to the resulting behavior as potential occupants. This study addresses this research gap by validating a self-developed DRL-based lane changing model (trained using proximal policy optimization) from a technology acceptance perspective through an online crowdsourcing experiment. Participants (N=1085) viewed a random subset of 32 out of 120 videos of an automated vehicle driving on a three-lane highway with varying traffic densities featuring our proposed model or a baseline policy (i.e. a state-of-the-art rule-based model, MOBIL). They were tasked to press a response key if the decision-making was deemed undesirable and subsequently rated the vehicle's behavior along four acceptance constructs (performance expectancy, safety, human-likeness and reliability) on a scale of 1 to 5. Results showed that the proposed model caused a significantly lower amount of disagreements and was rated significantly higher on all four acceptance constructs compared to the baseline policy. Moreover, considerable differences between individual disagreement rates were observed for both models. Our findings offer prospects for the practical application of DRL-based lane change models in a use-case scenario, depending on the user. Further research is necessary to examine whether these observations hold in other (more complex) traffic situations. Additionally, we recommend combining DRL with other modelling techniques that allow for personalization of behavioral parameters, such as imitation learning.

Files

Thesis_Final_.pdf
(pdf | 31.8 Mb)
License info not available