Getting AI to Cooperate: Sharing a Critic in a Video Game

None, None

Getting AI to Cooperate: Sharing a Critic in a Video Game

Bachelor Thesis (2023)

Author(s)

J.J.H. Groenendijk (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

R.T. Loftin – Mentor (TU Delft - Interactive Intelligence)

Frans A. Oliehoek – Mentor (TU Delft - Interactive Intelligence)

K Hildebrandt – Graduation committee member (TU Delft - Computer Graphics and Visualisation)

Faculty

Electrical Engineering, Mathematics and Computer Science

Copyright

Artificial Intelligence Centralized Critic Video Games

To reference this document use:

https://resolver.tudelft.nl/uuid:bc1c8ced-5329-4a11-86a2-1f5154915d87

More Info

expand_more

Publication Year

2023

Language

English

Copyright

Graduation Date

28-06-2023

Awarding Institution

Delft University of Technology

Project

['CSE3000 Research Project']

Programme

['Computer Science and Engineering']

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

The popular video game "Overcooked" is a great example of a task requiring complex planning and cooperation with other players. This game is used as the inspiration for an environment for evaluating AI, called "Overcooked-AI". This paper implements a centralized critic into the Overcooked-AI environment's implementation of the PPO algorithm and compares the results with the decentralized critic approach when it comes to cooperation with human-like agents and computational efficiency. \\

The centralized critic approach gives similar results compared to the decentralized critic approach, both in self-play and when playing with human-like agents. This is probably due to the decentralized critic approach already having full access to the entire observation space, and no hyperparameter tuning being done due to a lack of time.

Files

Final_Bachelor_Thesis_Jelle_Co... (pdf)

(pdf | 0.499 Mb)

License info not available