Learning a Latent Representation of the Opponent in Automated Negotiation

Gaghi, Radu

Learning a Latent Representation of the Opponent in Automated Negotiation

Title

Learning a Latent Representation of the Opponent in Automated Negotiation

Author

Gaghi, Radu (TU Delft Electrical Engineering, Mathematics and Computer Science)

Contributor

Murukannaiah, P.K. (mentor)
Renting, B.M. (mentor)
Zhang, X. (graduation committee)

Degree granting institution

Delft University of Technology

Programme

Computer Science and Engineering

Project

CSE3000 Research Project

Date

2022-06-23

Abstract

This paper introduces a strategy for learning opponent parameters in automated negotiation and using them for future negotiation sessions. The goal is to maximize the agent’s utility while being consistent in its performance over various negotiation scenarios. While a number of reinforcement learning approaches in the field have used Q-learning, this paper uses the newer Proximal Policy Optimization algorithm. Machine learning has been used in opponent modeling, classifying opponents, and learning strategies, but there have been few attempts to store and re-use this information. In an experimental setup, it is shown that this approach outperforms a baseline in terms of individual utility.

Subject

Automated negotiation
Machine learning
Reinforcement Learning
Opponent Modelling

To reference this document use:

http://resolver.tudelft.nl/uuid:376b66f3-09cd-4de3-bb67-7f58b0f86288

Part of collection

Student theses

Document type

bachelor thesis

Rights

Files

PDF

Learning_a_Latent_Represe ... iation.pdf

336.21 KB

Close viewer