Training a Negotiating Agent through Self-Play

Bachelor thesis (2022)

Authors

R. Jurševskis Electrical Engineering, Mathematics and Computer Science

Contributors

B.M. Renting Interactive Intelligence - (supervisor 1)

P.K. Murukannaiah Interactive Intelligence - (supervisor 1)

X. Zhang Pattern Recognition and Bioinformatics - (supervisor 2)

Faculty

Electrical Engineering, Mathematics and Computer Science

More Info

expand_more

To reference this document use:

http://resolver.tudelft.nl/uuid:4b1cc402-cb64-4c86-ac5b-6fcb891f8472

Published Date

23-06-2022

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

Recent developments in applying reinforcement learning to cooperative environments, like negotiation, have brought forward an important question: how well can a negotiating agent be trained through self-play? Previous research has seen successful application of self-play to other settings, like the games of chess and Go. This paper explores the usage of self-play within the training of a negotiating agent and determines if it is possible to successfully train an agent purely through self-play. The results of the experimentation show that a training stage using self-play can match or even exceed an approach using a set of training opponents. By using multiple self-play opponents, the average utility can be further improved by introducing more variance during training. In addition, using a combination of both self-play and training opponents leads to a hybrid approach that performs better than either of the two techniques separately.

Files

Research_Paper_2022.06.19_3.pd... (.pdf)

(.pdf | 0.394 Mb)