Reinforcement Learning for the Dynamic Berth Allocation Problem

Evaluating the Robustness of a Graph Neural Network Agent under Uncertainty of Estimated Time of Arrival

Bachelor Thesis (2026)
Author(s)

L.H.M. Geurts (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

C. March Moya – Mentor (TU Delft - Electrical Engineering, Mathematics and Computer Science)

N. Yorke-Smith – Graduation committee member (TU Delft - Electrical Engineering, Mathematics and Computer Science)

J.W. Böhmer – Graduation committee member (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Faculty
Electrical Engineering, Mathematics and Computer Science
More Info
expand_more
Publication Year
2026
Language
English
Graduation Date
23-06-2026
Awarding Institution
Delft University of Technology
Project
CSE3000 Research Project
Programme
Computer Science and Engineering
Faculty
Electrical Engineering, Mathematics and Computer Science
Downloads counter
7
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

The Dynamic Berth Allocation Problem is a port
scheduling problem where vessels arrive dynamically over time and must be assigned to a berth. A pre-trained Graph Neural Network (GNN) based
reinforcement learning approach solves the problem efficiently but is dependent on estimated times of arrival [11]. However, vessels predominantly arrive later than estimated. this introduces unwanted uncertainty. We evaluate a pre-trained GNN agent under controlled ETA perturbations to create an information gap between the estimated and actual arrival times, using the original full information setting as a reference. The agent is evaluated using a factorial experimental setup over different deviation levels and instances to evaluate robustness under ETA uncertainty. The results show that the agent is robust to optimistic ETA deviations, with only limited performance degradation even at
larger introduced deviations. The robustness appears to mainly stem from the reactive nature of the agent and its associated scheduling process, which
limits the influence of ETA deviations on the decision making process.

Files

License info not available