Bei Peng

info

Please Note

<p>This page displays the records of the person named above and is not linked to a unique person identifier. This record may need to be merged to a profile.</p>

Conference paper (4)

4 records found

FACMAC

Factored Multi-Agent Centralised Policy Gradients

Conference paper (2021) - Bei Peng , Tabish Rashid , Christian A. Schroeder de Witt , Pierre-Alexandre Kamienny , Philip H.S. Torr , J.W. Böhmer , Shimon Whiteson

We propose FACtored Multi-Agent Centralised policy gradients (FACMAC), a new method for cooperative multi-agent reinforcement learning in both discrete and continuous action spaces. Like MADDPG, a popular multi-agent actor-critic method, our approach uses deep deterministic polic ...

UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning

Conference paper (2021) - Tarun Gupta , Anuj Mahajan , Bei Peng , Wendelin Böhmer , Shimon Whiteson

VDN and QMIX are two popular value-based algorithms for cooperative MARL that learn a centralized action value function as a monotonic mixing of per-agent utilities. While this enables easy decentralization of the learned policy, the restricted joint action value function can pre ...

Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning

Conference paper (2021) - Shariq Iqbal , Christian A. Schroeder de Witt , Bei Peng , Wendelin Böhmer , Shimon Whiteson , Fei Sha

Real world multi-agent tasks often involve varying types and quantities of agents and non-agent entities; however, agents within these tasks rarely need to consider all others at all times in order to act effectively. Factored value function approaches have historically leveraged ...

Optimistic Exploration even with a Pessimistic Initialisation

Conference paper (2020) - Tabish Rashid , Bei Peng , Wendelin Böhmer , Shimon Whiteson