Sam Devlin

Conference paper (1)

Journal article (1)

2 records found

Difference rewards policy gradients

Journal article (2022) - Jacopo Castellini (author) , Sam Devlin (author) , FA Oliehoek (author) , Rahul Savani (author)

Policy gradient methods have become one of the most popular classes of algorithms for multi-agent reinforcement learning. A key challenge, however, that is not addressed by many of these methods is multi-agent credit assignment: assessing an agent’s contribution to the overall pe ...

Difference Rewards Policy Gradients

Conference paper (2021) - Jacopo Castellini (author) , FA Oliehoek (author) , Sam Devlin (author) , Rahul Savani (author)