TG
T. Greevink
info
Please Note
<p>This page displays the records of the person named above and is not linked to a unique person identifier. This record may need to be merged to a profile.</p>
1 records found
1
Prioritized Experience Replay based on the Wasserstein Metric in Deep Reinforcement Learning
The regularizing effect of modelling return distributions
This thesis tests the hypothesis that distributional deep reinforcement learning (RL) algorithms get an increased performance over expectation based deep RL because of the regularizing effect of fitting a more complex model. This hypothesis was tested by comparing two variations
...