Reinforcement Learning for the Dynamic Berth Allocation Problem

None, None

Reinforcement Learning for the Dynamic Berth Allocation Problem

Evaluating the Robustness of a Graph Neural Network Agent under Uncertainty of Estimated Time of Arrival

Bachelor Thesis (2026)

Author(s)

L.H.M. Geurts (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

C. March Moya – Mentor (TU Delft - Electrical Engineering, Mathematics and Computer Science)

N. Yorke-Smith – Graduation committee member (TU Delft - Electrical Engineering, Mathematics and Computer Science)

J.W. Böhmer – Graduation committee member (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Faculty

Electrical Engineering, Mathematics and Computer Science

Reinforcement learning ETA Berth Allocation Problem Graph Neural Networks (GNN) Dynamic Berth Allocation Problem ETA Uncertainty

To reference this document use

https://resolver.tudelft.nl/uuid:224eb7de-0947-4c8e-8509-8f9396daa4af

More Info

expand_more

Publication Year

2026

Language

English

Graduation Date

23-06-2026

Awarding Institution

Delft University of Technology

Project

CSE3000 Research Project

Programme

Computer Science and Engineering

Faculty

Electrical Engineering, Mathematics and Computer Science

Downloads counter

7

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

The Dynamic Berth Allocation Problem is a port
scheduling problem where vessels arrive dynamically over time and must be assigned to a berth. A pre-trained Graph Neural Network (GNN) based
reinforcement learning approach solves the problem efficiently but is dependent on estimated times of arrival [11]. However, vessels predominantly arrive later than estimated. this introduces unwanted uncertainty. We evaluate a pre-trained GNN agent under controlled ETA perturbations to create an information gap between the estimated and actual arrival times, using the original full information setting as a reference. The agent is evaluated using a factorial experimental setup over different deviation levels and instances to evaluate robustness under ETA uncertainty. The results show that the agent is robust to optimistic ETA deviations, with only limited performance degradation even at
larger introduced deviations. The robustness appears to mainly stem from the reactive nature of the agent and its associated scheduling process, which
limits the influence of ETA deviations on the decision making process.

Files

Paper_Research_Project_Lasse_G... (pdf)

(pdf | 0.286 Mb)

License info not available