P. Stan

Bachelor thesis (1)

1 records found

Detecting Environment Changes via Quantile Spread in Quantile Regression Deep-Q Networks

Bachelor thesis (2025) - P. Stan (author) , M.M. Celikok (mentor) , Frans A Oliehoek (mentor) , Annibale Panichella (graduation committee member)

Reinforcement learning agents are trained in well-defined environments and evaluated under the assumption that the test time conditions match those encountered during training. However, even small changes in the environment’s dynamics can degrade the policy’s performance, even mo ...