Search results | TU Delft Repositories

Searched for: subject%3A%22Reinforcement%255C%252BLearning%22

(1 - 1 of 1)

document: Prioritized Experience Replay based on the Wasserstein Metric in Deep Reinforcement Learning: The regularizing effect of modelling return distributions
Greevink, Thijs (author)
This thesis tests the hypothesis that distributional deep reinforcement learning (RL) algorithms get an increased performance over expectation based deep RL because of the regularizing effect of fitting a more complex model. This hypothesis was tested by comparing two variations of the distributional QR-DQN algorithm combined with prioritized...
master thesis 2019