Search results | TU Delft Repositories

Searched for: collection%253Air

(1 - 1 of 1)

document: Fine-tuning deep RL with gradient-free optimization
de Bruin, T.D. (author), Kober, J. (author), Tuyls, Karl (author), Babuska, R. (author)
Deep reinforcement learning makes it possible to train control policies that map high-dimensional observations to actions. These methods typically use gradient-based optimization techniques to enable relatively efficient learning, but are notoriously sensitive to hyperparameter choices and do not have good convergence properties. Gradient...
journal article 2020