Searched for: subject%3A%22Value%255C%2BIteration%22
(1 - 6 of 6)
document
Mur Uribe, Pol (author)
This thesis introduces a new method, called Mixed Iteration, for controlling Markov Decision Processes when partial information is known about the dynamics of the Markov Decision Process. The algorithm uses sampling to calculate the expectation of partially known dynamics in stochastic environments. Its goal is to lower the number of iterations...
master thesis 2023
document
Fu, Bin (author), Sun, B. (author), Guo, Hang (author), Yang, Tao (author), Fu, Wenxing (author)
The current study presents an online iterative adaptive dynamic programming approach to resolve the zero-sum game (ZSG) for nonlinear continuous-time (CT) systems containing a partially unknown dynamic. The Hamilton-Jacobian-Issacs (HJI) equation is solved along the state trajectory according to the value function approximation and the policy...
conference paper 2023
document
Vitanov, George (author)
This thesis discusses the chemical composition (basicity) control problem of HIsarna, an experimental iron furnace which operates with 30% less CO2 emissions than its traditional blast furnace counterparts. The control challenge is keeping the basicity of the plant in a narrow operating region. A mass balance model of the plant was constructed -...
master thesis 2022
document
Kubalik, Jiri (author), Derner, Erik (author), Zegklitz, Jan (author), Babuska, R. (author)
Reinforcement learning algorithms can solve dynamic decision-making and optimal control problems. With continuous-valued state and input variables, reinforcement learning algorithms must rely on function approximators to represent the value function and policy mappings. Commonly used numerical approximators, such as neural networks or basis...
journal article 2021
document
Verdier, C.F. (author), Babuska, R. (author), Shyrokau, B. (author), Mazo, M. (author)
Control systems designed via learning methods, aiming at quasi-optimal solutions, typically lack stability and performance guarantees. We propose a method to construct a near-optimal control law by means of model-based reinforcement learning and subsequently verifying the reachability and safety of the closed-loop control system through an...
journal article 2019
document
Kubalík, Jiří (author), Alibekov, Eduard (author), Babuska, R. (author)
Model-based reinforcement learning (RL) algorithms can be used to derive optimal control laws for nonlinear dynamic systems. With continuous-valued state and input variables, RL algorithms have to rely on function approximators to represent the value function and policy mappings. This paper addresses the problem of finding a smooth policy...
journal article 2017
Searched for: subject%3A%22Value%255C%2BIteration%22
(1 - 6 of 6)