B.D. Damian

Bachelor thesis (1)

1 records found

Conditional Normalizing Flows for Modeling Environment Stochasticity

Using a MuZero-based learned model

Bachelor thesis (2025) - B.D. Damian (author) , Frans Oliehoek (mentor) , J. He (mentor) , Michael Weinmann (graduation committee member)

Planning agents have demonstrated superhuman performance in deterministic environments, such as chess and Go, by combining end-to-end reinforcement learning with powerful tree-based search algorithms. To extend such agents to stochastic or partially observable domains, Stochastic ...