Search results | TU Delft Repositories

Searched for: subject%3A%22reinforcement%255C%252Blearning%22

(1 - 2 of 2)

document: Difference rewards policy gradients
Castellini, Jacopo (author), Devlin, Sam (author), Oliehoek, F.A. (author), Savani, Rahul (author)
Policy gradient methods have become one of the most popular classes of algorithms for multi-agent reinforcement learning. A key challenge, however, that is not addressed by many of these methods is multi-agent credit assignment: assessing an agent’s contribution to the overall performance, which is crucial for learning good policies. We...
journal article 2022

document: Difference Rewards Policy Gradients
Castellini, Jacopo (author), Oliehoek, F.A. (author), Devlin, Sam (author), Savani, Rahul (author)
Policy gradient methods have become one of the most popular classes of algorithms for multi-agent reinforcement learning. A key challenge, however, that is not addressed by many of these methods is multi-agent credit assignment: assessing an agent’s contribution to the overall performance, which is crucial for learning good policies. We propose...
conference paper 2021