P. Mur Uribe

info

Please Note

<p>This page displays the records of the person named above and is not linked to a unique person identifier. This record may need to be merged to a profile.</p>

Master thesis (1)

1 records found

On the road from Model-Based Dynamical Programming to Model-Free Reinforcement Learning

A sample efficient approach

Master thesis (2023) - P. Mur Uribe , P. Mohajerin Esfahani , M.A. Sharifi Kolarijani

This thesis introduces a new method, called Mixed Iteration, for controlling Markov Decision Processes when partial information is known about the dynamics of the Markov Decision Process. The algorithm uses sampling to calculate the expectation of partially known dynamics in stoc ...