Getting AI to Cooperate: Sharing a Critic in a Video Game

Bachelor Thesis (2023)
Author(s)

J.J.H. Groenendijk (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Robert Loftin – Mentor (TU Delft - Interactive Intelligence)

Frans Oliehoek – Mentor (TU Delft - Interactive Intelligence)

K Hildebrandt – Graduation committee member (TU Delft - Computer Graphics and Visualisation)

Faculty
Electrical Engineering, Mathematics and Computer Science
Copyright
© 2023 Jelle Groenendijk
More Info
expand_more
Publication Year
2023
Language
English
Copyright
© 2023 Jelle Groenendijk
Graduation Date
28-06-2023
Awarding Institution
Delft University of Technology
Project
CSE3000 Research Project
Programme
Computer Science and Engineering
Faculty
Electrical Engineering, Mathematics and Computer Science
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

The popular video game "Overcooked" is a great example of a task requiring complex planning and cooperation with other players. This game is used as the inspiration for an environment for evaluating AI, called "Overcooked-AI". This paper implements a centralized critic into the Overcooked-AI environment's implementation of the PPO algorithm and compares the results with the decentralized critic approach when it comes to cooperation with human-like agents and computational efficiency. \\

The centralized critic approach gives similar results compared to the decentralized critic approach, both in self-play and when playing with human-like agents. This is probably due to the decentralized critic approach already having full access to the entire observation space, and no hyperparameter tuning being done due to a lack of time.

Files

License info not available