M.R. Rodić

Bachelor thesis (1)

1 records found

Evaluating and Enhancing the Robustness of Proximal Policy Optimization to Test-Time Corruptions in Sequential Domains

Bachelor thesis (2025) - M.R. Rodić (author) , M.M. Celikok (mentor) , Frans A Oliehoek (mentor) , Annibale Panichella (graduation committee member)

Reinforcement learning (RL) agents often achieve impressive results in simulation but can fail catastrophically when facing small deviations at deployment time. In this work, we examine the brittleness of Proximal Policy Optimization (PPO) agents when subjected to test-time obser ...