Search results | TU Delft Repositories

Searched for: contributor%3A%22Bohmer%2C+Wendelin+%28graduation+committee%29%22

(1 - 1 of 1)

document: Prioritizing states with action sensitive return in experience replay
Keijzer, Alexander (author)
Experience replay for off-policy reinforcement learning has been shown to improve sample efficiency and stabilize training. However, typical uniformly sampled replay includes many irrelevant samples for the agent to reach good performance. We introduce Action Sensitive Experience Replay (ASER), a method to prioritize samples in the replay buffer...
master thesis 2023