Search results | TU Delft Repositories

Searched for: collection%253Air

(1 - 1 of 1)

document: Policy derivation methods for critic-only reinforcement learning in continuous spaces
Alibekov, Eduard (author), Kubalik, Jiri (author), Babuska, R. (author)
This paper addresses the problem of deriving a policy from the value function in the context of critic-only reinforcement learning (RL) in continuous state and action spaces. With continuous-valued states, RL algorithms have to rely on a numerical approximator to represent the value function. Numerical approximation due to its nature virtually...
journal article 2018