Policy Space Response Oracles

A Survey

Conference Paper (2024)
Author(s)

A. Bighashdel (Eindhoven University of Technology, TU Delft - Sequential Decision Making)

Yongzhao Wang (The Alan Turing Institute)

Stephen McAleer (Carnegie Mellon University)

Rahul Savani (The Alan Turing Institute, University of Liverpool)

F.A. Oliehoek (TU Delft - Sequential Decision Making)

Research Group
Sequential Decision Making
DOI related publication
https://doi.org/10.24963/ijcai.2024/880
More Info
expand_more
Publication Year
2024
Language
English
Research Group
Sequential Decision Making
Pages (from-to)
7951-7961
ISBN (electronic)
978-1-956792-04-1
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Game theory provides a mathematical way to study the interaction between multiple decision makers. However, classical game-theoretic analysis is limited in scalability due to the large number of strategies, precluding direct application to more complex scenarios. This survey provides a comprehensive overview of a framework for large games, known as Policy Space Response Oracles (PSRO), which holds promise to improve scalability by focusing attention on sufficient subsets of strategies. We first motivate PSRO and provide historical context. We then focus on the strategy exploration problem for PSRO: the challenge of assembling effective subsets of strategies that still represent the original game well with minimum computational cost. We survey current research directions for enhancing the efficiency of PSRO, and explore the applications of PSRO across various domains. We conclude by discussing open questions and future research.

Files

0880.pdf
(pdf | 0.364 Mb)
License info not available