Evaluating robustness of deep reinforcement learning for autonomous driving

None, None

Evaluating robustness of deep reinforcement learning for autonomous driving

Effects of domain randomization on training and robustness

Bachelor Thesis (2023)

Author(s)

E. Bayram (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

M. T.J. Spaan – Mentor (TU Delft - Algorithmics)

M.A. Zanger – Mentor (TU Delft - Algorithmics)

E. Congeduti – Graduation committee member (TU Delft - Computer Science & Engineering-Teaching Team)

Faculty

Electrical Engineering, Mathematics and Computer Science

Copyright

Deep Reinforcement Learning Domain Randomization Autonomous driving

To reference this document use:

https://resolver.tudelft.nl/uuid:6754132c-38be-4ead-a4e6-0ea6c477dc59

More Info

expand_more

Publication Year

2023

Language

English

Copyright

Graduation Date

28-06-2023

Awarding Institution

Delft University of Technology

Project

['CSE3000 Research Project']

Programme

['Computer Science and Engineering']

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Deep reinforcement learning has been a topic of research in recent years and has been expanding into the domain of autonomous driving. As autonomous driving is likely to involve people, such as daily commuters, it is necessary to ensure the machine will perform well enough in real-life environments not to put anyone at risk. There exist possible approaches to make the transition from a simulation to real life easier, such as domain randomization. This paper uses OpenAI's CarRacing-v2 environment and the CARLA simulator to investigate the effect of domain randomization on training efficiency and robustness for a Deep Q-Network algorithm for autonomous driving. The results show a decrease in training efficiency and higher variance during training for both environments. CARLA also indicates an overestimation during training. As for robustness testing, while visual domain randomization in CarRacing-v2 does not suggest a significant influence on robustness, the dynamic domain randomization in CARLA offers a positive influence toward robustness at the expense of some reward.

Files

Ege_Bayram_Research_Paper.pdf

(pdf | 0.98 Mb)

License info not available