Searched for: subject%3A%22Value%255C%252BIteration%22
(1 - 1 of 1)
document
Kubalík, Jiří (author), Alibekov, Eduard (author), Babuska, R. (author)
Model-based reinforcement learning (RL) algorithms can be used to derive optimal control laws for nonlinear dynamic systems. With continuous-valued state and input variables, RL algorithms have to rely on function approximators to represent the value function and policy mappings. This paper addresses the problem of finding a smooth policy...
journal article 2017