Searched for: subject%3A%22Reinforcement%255C%252BLearning%22
(1 - 5 of 5)
document
de Bruin, T.D. (author)
The arrival of intelligent, general-purpose robots that can learn to perform new tasks autonomously has been promised for a long time now. Deep reinforcement learning, which combines reinforcement learning with deep neural network function approximation, has the potential to enable robots to learn to perform a wide range of new tasks while...
doctoral thesis 2020
document
de Bruin, T.D. (author), Kober, J. (author), Tuyls, Karl (author), Babuska, R. (author)
Deep reinforcement learning makes it possible to train control policies that map high-dimensional observations to actions. These methods typically use gradient-based optimization techniques to enable relatively efficient learning, but are notoriously sensitive to hyperparameter choices and do not have good convergence properties. Gradient...
journal article 2020
document
Kulhánek, J. (author), Derner, Erik (author), de Bruin, T.D. (author), Babuska, R. (author)
Deep reinforcement learning (RL) has been successfully applied to a variety of game-like environments. However, the application of deep RL to visual navigation with realistic environments is a challenging task. We propose a novel learning architecture capable of navigating an agent, e.g. a mobile robot, to a target given by an image. To...
conference paper 2019
document
Buşoniu, Lucian (author), de Bruin, T.D. (author), Tolić, Domagoj (author), Kober, J. (author), Palunko, Ivana (author)
Reinforcement learning (RL) offers powerful algorithms to search for optimal controllers of systems with nonlinear, possibly stochastic dynamics that are unknown or highly uncertain. This review mainly covers artificial-intelligence approaches to RL, from the viewpoint of the control engineer. We explain how approximate representations of the...
review 2018
document
de Bruin, T.D. (author), Kober, J. (author), Tuyls, K.P. (author), Babuska, R. (author)
Experience replay is a technique that allows off-policy reinforcement-learning methods to reuse past experiences. The stability and speed of convergence of reinforcement learning, as well as the eventual performance of the learned policy, are strongly dependent on the experiences being replayed. Which experiences are replayed depends on two...
journal article 2018
Searched for: subject%3A%22Reinforcement%255C%252BLearning%22
(1 - 5 of 5)