- document
-
Alibekov, Eduard (author), Kubalìk, Jiřì (author), Babuska, R. (author)This paper addresses the problem of deriving a policy from the value function in the context of reinforcement learning in continuous state and input spaces. We propose a novel method based on genetic programming to construct a symbolic function, which serves as a proxy to the value function and from which a continuous policy is derived. The...conference paper 2016