Toward Reliable Robot Navigation Using Deep Reinforcement Learning

Master Thesis (2022)
Author(s)

T.L. van Rietbergen (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

R. Venkatesha Venkatesha Prasad – Mentor (TU Delft - Embedded Systems)

Amjad Amjad Yousef Majid – Graduation committee member (TU Delft - Embedded Systems)

Robert Babuska – Graduation committee member (TU Delft - Learning & Autonomous Control)

Faculty
Electrical Engineering, Mathematics and Computer Science
Copyright
© 2022 Tomas van Rietbergen
More Info
expand_more
Publication Year
2022
Language
English
Copyright
© 2022 Tomas van Rietbergen
Graduation Date
07-11-2022
Awarding Institution
Delft University of Technology
Programme
['Electrical Engineering | Embedded Systems']
Related content

Youtube channel with demonstration videos related to thesis project.

https://www.youtube.com/user/tomasvr1
Faculty
Electrical Engineering, Mathematics and Computer Science
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Reliable indoor navigation in the presence of dynamic obstacles is an essential capability for mobile robot deployment. Previous work on robot navigation focuses on expanding the network structure and hardware setup leading to more complex and costly systems. The accompanying physical demonstrations are often limited to slow-moving agents and simplistic obstacle configurations. In this thesis, we develop an end-to-end navigation system with a focus on real-world transferability and produce a low-cost and customizable robot platform. Instead of expanding the network structure, we rely on external capabilities such as backward motion, frame stacking, and behavioral reward design to improve performance while preserving transferability. By convention, these methods have been largely disregarded in previous works on deep reinforcement learning (DRL) for unmanned ground vehicle (UGV) navigation. We analyze the effect on performance in simulation of different off-policy algorithms with hyperparameter and reward function configurations. Experimental results show that our agent can achieve state-of-the-art performance in challenging and unseen simulated environments. In addition, physical robot demonstrations show that our system is capable of dealing with fast-moving and unpredictable agents in a real-world environment.

Files

Tomas_Thesis_Final.pdf
(pdf | 9.09 Mb)
License info not available