Investigation into the Effect of Replay Buffer Diversity on Generalizability

None, None

Investigation into the Effect of Replay Buffer Diversity on Generalizability

Master Thesis (2024)

Author(s)

F. Kaubek (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

J.W. Böhmer – Mentor (TU Delft - Sequential Decision Making)

David Tax – Graduation committee member (TU Delft - Pattern Recognition and Bioinformatics)

Faculty

Electrical Engineering, Mathematics and Computer Science

Machine learning Generalization Reinforcement learning

To reference this document use:

https://resolver.tudelft.nl/uuid:bb41c3e3-59e4-4b1e-8075-2065f44110dd

More Info

expand_more

Publication Year

2024

Language

English

Graduation Date

02-08-2024

Awarding Institution

Delft University of Technology

Programme

['Computer Science']

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

In reinforcement learning, the ability to generalize to unseen situations is pivotal to an agent’s success. In this thesis, two novel methods that aim to enhance the generalizability of an agent will be introduced. Both of the methods rely on the idea that the diversity of a replay buffer increases an agent’s ability to generalize. The first utilizes the agent’s exploration strategies to reach interesting states. The second aims to reach further using an additional goal-conditioned agent. Both methods demonstrate improved adaptability without relying on domain-specific knowledge and show promising results.

Files

Thesis_FK.pdf

(pdf | 0.62 Mb)

License info not available