Searched for: +
(1 - 2 of 2)
document
Moerland, T.M. (author), Broekens, D.J. (author), Jonker, C.M. (author)
In this paper we study how to learn stochastic, multimodal transition dynamics in reinforcement learning (RL) tasks. We focus on evaluating transition function estimation, while we defer planning over this model to future work. Stochasticity is a fundamental property of many task environments. However, discriminative function approximators have...
conference paper 2017
document
Moerland, T.M. (author), Broekens, D.J. (author), Jonker, C.M. (author)
This paper studies directed exploration for reinforcement learning agents by tracking uncertainty about the value of each available action. We identify two sources of uncertainty that are relevant for exploration. The first originates from limited data (parametric uncertainty), while the second originates from the distribution of the returns ...
conference paper 2017