Deep coordination graphs

None, None; None, None; None, None

Deep coordination graphs

Conference Paper (2020)

Author(s)

Wendelin Böhmer (University of Oxford)

Vitaly Kurin (University of Oxford)

Shimon Whiteson (University of Oxford)

Affiliation

External organisation

To reference this document use:

https://resolver.tudelft.nl/uuid:1095262d-f0dd-477d-9e3e-9fd470f2392b

More Info

expand_more

Publication Year

2020

Language

English

Affiliation

External organisation

Pages (from-to)

957-968

ISBN (electronic)

9781713821120

Abstract

This paper introduces the deep coordination graph (DCG) for collaborative multi-agent reinforcement learning. DCG strikes a flexible tradeoff between representational capacity and generalization by factoring the joint value function of all agents according to a coordination graph into payoffs between pairs of agents. The value can be maximized by local message passing along the graph, which allows training of the value function end-to-end with Q-learning. Payoff functions are approximated with deep neural networks that employ parameter sharing and low-rank approximations to significantly improve sample efficiency. We show that DCG can solve predatorprey tasks that highlight the relative overgeneralization pathology, as well as challenging StarCraft II micromanagement tasks.

No files available

Metadata only record. There are no files for this record.