Deep coordination graphs

Conference Paper (2020)
Author(s)

Wendelin Böhmer (University of Oxford)

Vitaly Kurin (University of Oxford)

Shimon Whiteson (University of Oxford)

Affiliation
External organisation
More Info
expand_more
Publication Year
2020
Language
English
Affiliation
External organisation
Pages (from-to)
957-968
ISBN (electronic)
9781713821120

Abstract

This paper introduces the deep coordination graph (DCG) for collaborative multi-agent reinforcement learning. DCG strikes a flexible tradeoff between representational capacity and generalization by factoring the joint value function of all agents according to a coordination graph into payoffs between pairs of agents. The value can be maximized by local message passing along the graph, which allows training of the value function end-to-end with Q-learning. Payoff functions are approximated with deep neural networks that employ parameter sharing and low-rank approximations to significantly improve sample efficiency. We show that DCG can solve predatorprey tasks that highlight the relative overgeneralization pathology, as well as challenging StarCraft II micromanagement tasks.

No files available

Metadata only record. There are no files for this record.