Feature Engineering in Reinforcement Learning for Algorithmic Trading

None, None

Feature Engineering in Reinforcement Learning for Algorithmic Trading

Investigating the Effects of State Representation on Trading Agent Performance in the Forex Market

Bachelor Thesis (2025)

Author(s)

F.A. van Oosterhout (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

N. Yorke-Smith – Mentor (TU Delft - Algorithmics)

Antonis Papapantoleon – Mentor (TU Delft - Applied Probability)

M.A. Sharifi Kolarijani – Mentor (TU Delft - Team Amin Sharifi Kolarijani)

Julia Olkhovskaya – Graduation committee member (TU Delft - Sequential Decision Making)

Faculty

Electrical Engineering, Mathematics and Computer Science

Reinforcement Learning Feature Engineering Artificial Intelligence Algorithmic Trading Deep Q-Network Quantitative Finance Forex Trading

To reference this document use:

https://resolver.tudelft.nl/uuid:b50b1185-38d6-4385-a8a5-40ddc67e5567

More Info

expand_more

Publication Year

2025

Language

English

Graduation Date

24-06-2025

Awarding Institution

Delft University of Technology

Project

['CSE3000 Research Project']

Programme

['Computer Science and Engineering']

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

This study explores how different features impact a Reinforcement Learning agent's performance in forex trading. Using a Deep Q-Network (DQN) agent and EUR/USD data from 2022-2024, we found that performance is highly sensitive to the information provided. Key findings show that for feature types like momentum and volatility, a single indicator outperformed a combination of them, as the latter tended to introduce noise. Including information about the agent's own status, such as its current trade duration, was beneficial. Counter-intuitively, providing more historical data consistently worsened performance, leading to overfitting where the agent memorized training data rather than learning general strategies. The main conclusion is that creating an effective state representation is a trade-off; the complexity of the input data must match the learning algorithm's ability to process it without overfitting.

Files

RP_G77_Final_Paper_Finn_V3.pdf

(pdf | 4.71 Mb)

License info not available