Multi-Agent Reinforcement Learning for Portfolio Management

None, None

Multi-Agent Reinforcement Learning for Portfolio Management

Master Thesis (2024)

Author(s)

M. Choi (TU Delft - Mechanical Engineering)

Contributor(s)

Cosimo Lieu – Mentor (TU Delft - Learning & Autonomous Control)

M.M. Celikok – Mentor (TU Delft - Sequential Decision Making)

Mike Chen – Mentor (Robeco B.V.)

Clint Howard – Mentor (Robeco B.V.)

Danny Huang – Mentor (Robeco B.V.)

J. Alonso-Mora – Graduation committee member (TU Delft - Learning & Autonomous Control)

Faculty

Mechanical Engineering

Reinforcement Learning Multi-agent reinforcement learning Multi-agent system Deep Learning Sequential decision making Portfolio Management

To reference this document use:

https://resolver.tudelft.nl/uuid:24e31756-51d3-48d6-ad2d-6b2498bd4246

More Info

expand_more

Publication Year

2024

Language

English

Graduation Date

29-08-2024

Awarding Institution

Delft University of Technology

Programme

['Mechanical Engineering | Vehicle Engineering | Cognitive Robotics']

Abstract

Reinforcement learning (RL) is a powerful tool where the agents – or “robots” can learn from the environment based on their actions. Reinforcement learning approaches were found successful in combining predicting stock returns and portfolio allocation. Diversification is a critical element for achieving high portfolio returns with a lower level of risk. This work explores the application of reinforcement learning and multi-agent reinforcement learning (MARL) in portfolio management, emphasizing the applicability and increased portfolio performance via strategy diversification. A key contribution of this work is the development and evaluation of a MARL environment incorporating novel diversity measures, correlation, and total variation distance. The findings reveal that while fine-tuned single-agent RL models can demonstrate strong performance, roughly tuned MARL models with diverse agents reflecting a "portfolio of portfolios" paradigm show improved action diversification and portfolio performances. The work also highlights the critical role of long-term robustness testing, algorithm- and problem-specific hyperparameter optimization, and the challenges of adapting MARL methods to financial contexts. This work contributes to the RL research in portfolio management by exploring the use of MARL in portfolio management and discussing the limitations and future work directions.

Files

THESIS_MC_MARL_FOR_PM_AUG2024.... (pdf)

(pdf | 5.03 Mb)

License info not available