Searched for: subject%3A%22sample%255C%252Befficiency%22
(1 - 1 of 1)
document
Mur Uribe, Pol (author)
This thesis introduces a new method, called Mixed Iteration, for controlling Markov Decision Processes when partial information is known about the dynamics of the Markov Decision Process. The algorithm uses sampling to calculate the expectation of partially known dynamics in stochastic environments. Its goal is to lower the number of iterations...
master thesis 2023