Coupled and Model-based cooperative planning in Overcooked AI

Bachelor Thesis (2022)
Author(s)

N. van Veen (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

R.T. Loftin – Mentor (TU Delft - Interactive Intelligence)

Frans Oliehoek – Mentor (TU Delft - Interactive Intelligence)

S.E. Verwer – Graduation committee member (TU Delft - Cyber Security)

Faculty
Electrical Engineering, Mathematics and Computer Science
Copyright
© 2022 Nils van Veen
More Info
expand_more
Publication Year
2022
Language
English
Copyright
© 2022 Nils van Veen
Graduation Date
27-06-2022
Awarding Institution
Delft University of Technology
Project
['CSE3000 Research Project', 'Cooperative AI for Overcooked!']
Programme
['Computer Science and Engineering']
Faculty
Electrical Engineering, Mathematics and Computer Science
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

In the field of cooperative AI, an environment is created called Overcooked AI based on the popular Overcooked game. Originally the environment is used to study deep reinforcement learning, on the other hand it also allows for cooperative planning methods of which the paper will focus on. These methods include coupled based planning with replanning and model-based planning. This research paper attempts to reproduce the results the Overcooked AI environment developers obtained and to improve the Coupled Planning algorithm to gain higher results. In particular, experiments were performed against themselves and a human model for the planning methods, and an improved coupled planning algorithm, in which the failures are handled by deviating from optimal play, under different game steps. And a study on collision failures is performed. The results concluded that extrapolation of results are sub-optimal and that collision failures can be significantly reduced by handling collision differently; walking into the opposite direction.

Files

Bsc_thesis_nils_van_veen.pdf
(pdf | 0.574 Mb)
License info not available