Learning a Latent Representation of the Opponent in Automated Negotiation

None, None

Learning a Latent Representation of the Opponent in Automated Negotiation

Bachelor Thesis (2022)

Author(s)

R. Gaghi (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Pradeep Kumar Murukannaiah – Mentor (TU Delft - Interactive Intelligence)

B.M. Renting – Mentor (TU Delft - Interactive Intelligence)

X. Zhang – Graduation committee member (TU Delft - Pattern Recognition and Bioinformatics)

Faculty

Electrical Engineering, Mathematics and Computer Science

Copyright

Reinforcement Learning Machine learning Automated negotiation Opponent Modelling

To reference this document use:

https://resolver.tudelft.nl/uuid:376b66f3-09cd-4de3-bb67-7f58b0f86288

More Info

expand_more

Publication Year

2022

Language

English

Copyright

Graduation Date

23-06-2022

Awarding Institution

Delft University of Technology

Project

['CSE3000 Research Project']

Programme

['Computer Science and Engineering']

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

This paper introduces a strategy for learning opponent parameters in automated negotiation and using them for future negotiation sessions. The goal is to maximize the agent’s utility while being consistent in its performance over various negotiation scenarios. While a number of reinforcement learning approaches in the field have used Q-learning, this paper uses the newer Proximal Policy Optimization algorithm. Machine learning has been used in opponent modeling, classifying opponents, and learning strategies, but there have been few attempts to store and re-use this information. In an experimental setup, it is shown that this approach outperforms a baseline in terms of individual utility.

Files

Learning_a_Latent_Representati... (pdf)

(pdf | 0.328 Mb)

License info not available