Solving Transition-Independent Multi-agent MDPs with Sparse Interactions

None, None; None, None; None, None; None, None; None, None

Solving Transition-Independent Multi-agent MDPs with Sparse Interactions

Conference Paper (2016)

Author(s)

J.C.D. Scharpff (TU Delft - Algorithmics)

Diederik M. Roijers (Universiteit van Amsterdam)

F.A. Oliehoek (Universiteit van Amsterdam, University of Liverpool)

Matthijs Spaan (TU Delft - Algorithmics)

MM de Weerdt (TU Delft - Algorithmics)

Research Group

Algorithmics

Copyright

Markov Decision Process Transition-independent Multi-agent MDPs Reward interactions Conditional Return Graphs

To reference this document use:

https://resolver.tudelft.nl/uuid:a9a91806-a8f2-4c8a-833f-be8bcefbccbb

More Info

expand_more

Publication Year

2016

Language

English

Copyright

Research Group

Algorithmics

Pages (from-to)

3174-3180

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

In cooperative multi-agent sequential decision making under uncertainty, agents must coordinate to find an optimal joint policy that maximises joint value. Typical algorithms exploit additive structure in the value function, but in the fully-observable multi-agent MDP (MMDP) setting such structure is not present. We propose a new optimal solver for transition-independent MMDPs, in which agents can only affect their own state but their reward depends on joint transitions. We represent these de- pendencies compactly in conditional return graphs (CRGs). Using CRGs the value of a joint policy and the bounds on partially specified joint policies can be efficiently computed. We propose CoRe, a novel branch-and-bound policy search algorithm building on CRGs. CoRe typically requires less runtime than the available alternatives and finds solutions to previously unsolvable problems.

Files

10405_Article_Text_13933_1_2_2... (pdf)

(pdf | 0.912 Mb)

License info not available