Reinforcement Learning approach for decision-making in driver control shifting for semi-autonomous driving

Bachelor Thesis (2021)
Author(s)

E. Latoškinas (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Y. Li – Mentor (TU Delft - Algorithmics)

M. T.J. Spaan – Graduation committee member (TU Delft - Algorithmics)

Arie Deursen – Coach (TU Delft - Software Technology)

Faculty
Electrical Engineering, Mathematics and Computer Science
Copyright
© 2021 Evaldas Latoškinas
More Info
expand_more
Publication Year
2021
Language
English
Copyright
© 2021 Evaldas Latoškinas
Graduation Date
01-07-2021
Awarding Institution
Delft University of Technology
Project
CSE3000 Research Project
Programme
Computer Science and Engineering
Faculty
Electrical Engineering, Mathematics and Computer Science
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Semi-autonomous driving innovations aim to bridge the gap to fully autonomous driving by co-operating with human drivers to lead to optimal choices on who should drive in different scenarios by offering different automation levels. However, in the present day, known semi-autonomous driving solutions do not generalise to every complex case of driver and AI interaction. This limitation prompted research in attempting to solve the problem using artificial intelligence and machine learning techniques. This paper focuses on providing a reinforcement learning approach to solve one specific decision-making scenario of the driver initiating a shift of control to a different automation level. The decision problem was formulated as a Markov Decision Process, and the problem was solved both by a baseline handcrafted decision tree and a learned reinforcement learning policy using the DQN algorithm. The two policies were compared based on safety, comfort and efficiency metrics in a simulated driving environment. The results were indicative that a reinforcement learning policy generally ensured safety \& comfort and has shown increased efficiency over the baseline policy, however, it faced efficiency & comfort issues in outlier cases.

Files

Final_v3.pdf
(pdf | 1.3 Mb)
License info not available