Reinforcement Learning approach for decision-making in driver control shifting for semi-autonomous driving

None, None

Reinforcement Learning approach for decision-making in driver control shifting for semi-autonomous driving

Bachelor Thesis (2021)

Author(s)

E. Latoškinas (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Yang Li – Mentor (TU Delft - Algorithmics)

M.T.J. Spaan – Graduation committee member (TU Delft - Algorithmics)

A. van van Deursen – Coach (TU Delft - Software Technology)

Faculty

Electrical Engineering, Mathematics and Computer Science

Copyright

Reinforcement Learning Machine Learning Deep Learning Semi-autonomous vehicles Markov Decision Process Decision Making

To reference this document use:

https://resolver.tudelft.nl/uuid:438a9f4b-0f06-486d-8f68-dd203cae2af4

More Info

expand_more

Publication Year

2021

Language

English

Copyright

Graduation Date

01-07-2021

Awarding Institution

Delft University of Technology

Project

['CSE3000 Research Project']

Programme

['Computer Science and Engineering']

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Semi-autonomous driving innovations aim to bridge the gap to fully autonomous driving by co-operating with human drivers to lead to optimal choices on who should drive in different scenarios by offering different automation levels. However, in the present day, known semi-autonomous driving solutions do not generalise to every complex case of driver and AI interaction. This limitation prompted research in attempting to solve the problem using artificial intelligence and machine learning techniques. This paper focuses on providing a reinforcement learning approach to solve one specific decision-making scenario of the driver initiating a shift of control to a different automation level. The decision problem was formulated as a Markov Decision Process, and the problem was solved both by a baseline handcrafted decision tree and a learned reinforcement learning policy using the DQN algorithm. The two policies were compared based on safety, comfort and efficiency metrics in a simulated driving environment. The results were indicative that a reinforcement learning policy generally ensured safety \& comfort and has shown increased efficiency over the baseline policy, however, it faced efficiency & comfort issues in outlier cases.

Files

Final_v3.pdf

(pdf | 1.3 Mb)

License info not available