Search results | TU Delft Repositories

Searched for: subject%3A%22Value%255C%252BIteration%22

(1 - 1 of 1)

document: Optimal control via reinforcement learning with symbolic policy approximation
Kubalík, Jiří (author), Alibekov, Eduard (author), Babuska, R. (author)
Model-based reinforcement learning (RL) algorithms can be used to derive optimal control laws for nonlinear dynamic systems. With continuous-valued state and input variables, RL algorithms have to rely on function approximators to represent the value function and policy mappings. This paper addresses the problem of finding a smooth policy...
journal article 2017