Print Email Facebook Twitter Learning a Latent Representation of the Opponent in Automated Negotiation Title Learning a Latent Representation of the Opponent in Automated Negotiation Author Gaghi, Radu (TU Delft Electrical Engineering, Mathematics and Computer Science) Contributor Murukannaiah, P.K. (mentor) Renting, B.M. (mentor) Zhang, X. (graduation committee) Degree granting institution Delft University of Technology Programme Computer Science and Engineering Project CSE3000 Research Project Date 2022-06-23 Abstract This paper introduces a strategy for learning opponent parameters in automated negotiation and using them for future negotiation sessions. The goal is to maximize the agent’s utility while being consistent in its performance over various negotiation scenarios. While a number of reinforcement learning approaches in the field have used Q-learning, this paper uses the newer Proximal Policy Optimization algorithm. Machine learning has been used in opponent modeling, classifying opponents, and learning strategies, but there have been few attempts to store and re-use this information. In an experimental setup, it is shown that this approach outperforms a baseline in terms of individual utility. Subject Automated negotiationMachine learningReinforcement LearningOpponent Modelling To reference this document use: http://resolver.tudelft.nl/uuid:376b66f3-09cd-4de3-bb67-7f58b0f86288 Part of collection Student theses Document type bachelor thesis Rights © 2022 Radu Gaghi Files PDF Learning_a_Latent_Represe ... iation.pdf 336.21 KB Close viewer /islandora/object/uuid:376b66f3-09cd-4de3-bb67-7f58b0f86288/datastream/OBJ/view