Searched for: collection%253Air
(1 - 1 of 1)
document
Alibekov, Eduard (author), Kubalik, Jiri (author), Babuska, R. (author)
This paper addresses the problem of deriving a policy from the value function in the context of critic-only reinforcement learning (RL) in continuous state and action spaces. With continuous-valued states, RL algorithms have to rely on a numerical approximator to represent the value function. Numerical approximation due to its nature virtually...
journal article 2018