Policy Space Response Oracles

None, None; None, None; None, None; None, None; None, None

Policy Space Response Oracles

A Survey

Conference Paper (2024)

Author(s)

A. Bighashdel (Eindhoven University of Technology, TU Delft - Sequential Decision Making)

Yongzhao Wang (The Alan Turing Institute)

Stephen McAleer (Carnegie Mellon University)

Rahul Savani (The Alan Turing Institute, University of Liverpool)

Frans Oliehoek (TU Delft - Sequential Decision Making)

Research Group

Sequential Decision Making

DOI related publication

https://doi.org/10.24963/ijcai.2024/880

To reference this document use:

https://resolver.tudelft.nl/uuid:8e85e4b8-5088-4356-b343-91b6a77d7f87

More Info

expand_more

Publication Year

2024

Language

English

Research Group

Sequential Decision Making

Pages (from-to)

7951-7961

ISBN (electronic)

978-1-956792-04-1

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Game theory provides a mathematical way to study the interaction between multiple decision makers. However, classical game-theoretic analysis is limited in scalability due to the large number of strategies, precluding direct application to more complex scenarios. This survey provides a comprehensive overview of a framework for large games, known as Policy Space Response Oracles (PSRO), which holds promise to improve scalability by focusing attention on sufficient subsets of strategies. We first motivate PSRO and provide historical context. We then focus on the strategy exploration problem for PSRO: the challenge of assembling effective subsets of strategies that still represent the original game well with minimum computational cost. We survey current research directions for enhancing the efficiency of PSRO, and explore the applications of PSRO across various domains. We conclude by discussing open questions and future research.

Files

0880.pdf

(pdf | 0.364 Mb)

License info not available