Search results | TU Delft Repositories

Searched for: subject%3A%22control%22

(1 - 3 of 3)

document: Fine-tuning deep RL with gradient-free optimization
de Bruin, T.D. (author), Kober, J. (author), Tuyls, Karl (author), Babuska, R. (author)
Deep reinforcement learning makes it possible to train control policies that map high-dimensional observations to actions. These methods typically use gradient-based optimization techniques to enable relatively efficient learning, but are notoriously sensitive to hyperparameter choices and do not have good convergence properties. Gradient...
journal article 2020

document: Experience selection in deep reinforcement learning for control
de Bruin, T.D. (author), Kober, J. (author), Tuyls, K.P. (author), Babuska, R. (author)
Experience replay is a technique that allows off-policy reinforcement-learning methods to reuse past experiences. The stability and speed of convergence of reinforcement learning, as well as the eventual performance of the learned policy, are strongly dependent on the experiences being replayed. Which experiences are replayed depends on two...
journal article 2018

document: Improved deep reinforcement learning for robotics through distribution-based experience retention
de Bruin, T.D. (author), Kober, J. (author), Tuyls, K.P. (author), Babuska, R. (author)
Recent years have seen a growing interest in the use of deep neural networks as function approximators in reinforcement learning. In this paper, an experience replay method is proposed that ensures that the distribution of the experiences used for training is between that of the policy and a uniform distribution. Through experiments on a...
conference paper 2016