M.M. de Weerdt | TU Delft Repository

Introducing flexibility in any-start-time safe interval path planning

A case study on the Dutch railway network

Master thesis (2025) - E.A. Kemmeren (author) , M.M. de Weerdt (mentor) , I.K. Hanou (graduation committee member) , R.M.P. Goverde (graduation committee member)

During the daily operation of the railway network, ProRail is responsible for handling delays and planning ad hoc train movements. Train handling documents aid the traffic controllers in common situations. But when multiple trains are delayed, and these documents do not apply, th ...

Dominance-Aware Generation of Near-Optimal Alternatives in Energy System Models

Master thesis (2025) - L.W.L. van de Laar (author) , M.M. de Weerdt (mentor) , G.A. Morales España (mentor) , G. Neustroev (mentor) , José L. Rueda (graduation committee member)

Optimization models are widely used in energy system planning to identify cost-effective investment strategies. However, relying solely on a single optimal solution can be misleading, as it fails to account for model uncertainty, competing objectives, and stakeholder preferences. ...

Optimization models are widely used in energy system planning to identify cost-effective investment strategies. However, relying solely on a single optimal solution can be misleading, as it fails to account for model uncertainty, competing objectives, and stakeholder preferences. To address this, near-optimal alternatives, solutions that are close in cost to the optimum but structurally different, are increasingly used to support robust and flexible decision-making.

This thesis explores the generation and evaluation of near-optimal alternatives within energy systems, with a focus on improving the decision relevance of the generated alternatives. This thesis introduces a unified analytical framework, formalizing existing Modeling to Generate Alternatives (MGA) methods using weight vector formulations. This formulation enables a clearer comparison of different techniques that generate these alternatives. This analysis highlights the limitations of current evaluation metrics, particularly their inability to distinguish decision-relevant alternatives from decision-irrelevant ones.

To overcome this gap, the thesis proposes a novel evaluation metric based on dominance relations from multi-objective optimization. This metric identifies non-dominated alternatives, those not strictly worse than any other across all decision variables, as decision-relevant. The thesis introduces a new method that uses Directionally Weighted Variables to generate alternatives aligned with this dominance criterion.

The proposed approach is evaluated using a stylized energy investment model and benchmarked against existing MGA techniques. Results show that traditional methods tend to generate fewer non-dominated alternatives, while the new method generates more non-dominated alternatives within the near-optimal space. This work contributes a new perspective on alternative generation, bridging the gap between mathematical optimality and practical decision support.

Mechanism Design for Virtual Power Plants with Strategic Agents

Master thesis (2025) - W.J.M. Verschuren (author) , H. Xie (mentor) , M.M. de Weerdt (mentor) , Jochen Cremer (mentor) , M. Cvetkovic (graduation committee member)

As power systems increasingly rely on renewable energy, grid services traditionally supplied by central plants must increasingly be sourced from distributed energy resources (DERs). Virtual power plants (VPPs) aggregate DERs to act as a single entity, but coordination is complica ...

As power systems increasingly rely on renewable energy, grid services traditionally supplied by central plants must increasingly be sourced from distributed energy resources (DERs). Virtual power plants (VPPs) aggregate DERs to act as a single entity, but coordination is complicated by information asymmetry, possibly resulting in strategic behaviour. This thesis studies how we can design a mechanism for a commercial VPP, having to satisfy a fixed commitment, while optimising the revenue from the VPP operator.
We first develop a tractable, multi-period VPP model with linear costs, local and temporal constraints for DERs and soft system-wide commitments enforced via deviation penalties. On top of this model we design and compare four mechanisms: first-price sealed bid (FPSB), uniform pricing, Vickrey–Clarke–Groves (VCG) and Arrow–d’Aspremont– Gerard-Varet (AGV). We evaluate them on revenue optimality, weak budget balance, incentive compatibility, individual rationality and scalability. Furthermore, we investigate how the composition of a VPP’s portfolio could inform mechanism design choices. FPSB realises payments equalling costs under truthful reports and remains competitive for small strategic fractions, but overpayment grows with the share of strategic agents and with cost dispersion. Uniform pricing is comparatively insensitive to the strategic fraction but highly sensitive to cost dispersion, often leading to large overpayments. VCG is strategy-proof and insensitive to strategic behaviour, yet externality payments increase with cost dispersion and raise total payouts. AGV keeps the payment-to-cost ratio near or below one by relying on expected externalities and scaling, improving operator viability but potentially violating individual rationality in instances. These results yielded the following guidelines regarding the suitability of mechanisms. FPSB for low strategic participation, uniform pricing for homogeneous portfolios, VCG when truthfulness is vital and external funding is possible, and AGV when operator viability is the hard constraint with safeguards for individual rationality.F

Heuristics for Multivalued Decision Diagrams in Branch & Bound

Master thesis (2025) - J.K.K. Tjong (author) , M.M. de Weerdt (mentor) , W.-J. van Hoeve (graduation committee member) , Yukihiro Murakami (graduation committee member)

Decision diagrams have steadily become more prominent in the field of combinatorial optimization, being able to outperform the state-of-the-art in e.g. scheduling problems[13]. They have proven even more capable with the introduction of methods such as decision diagram-based Bran ...

Decision diagrams have steadily become more prominent in the field of combinatorial optimization, being able to outperform the state-of-the-art in e.g. scheduling problems[13]. They have proven even more capable with the introduction of methods such as decision diagram-based Branch & Bound. Often, layer-based binary decision diagram (BDD) encodings are used to encode the problem domain, where a layered structure determines the decisions and decision variables. Recently, state-based multivalued decision diagram (MDD) encodings have also been considered. These do not rely on a layered structure, as instead, each state makes its decisions on the variables independently of other states. This allows for more flexible decision-making as states do not have to compromise with other states on the decisions they make.

This thesis compares these newer state-based MDDs with the commonly used layer-based BDDs, while also introducing and evaluating heuristics for the state-based MDDs. These heuristics include a new beam restriction heuristic that limits the branching factor of the MDDs, a dynamic variable ordering strategy adapted for the new state-based context, and a new local bound for the maximum independent set problem (MISP).

The experiments of this thesis show that the state-based MDDs generally outperform the layer-based BDDs in both runtime and search tree size. Only for some instances with low graph densities, the state-based MDDs were slower. This is resolved with the newly introduced beam restrictions, which can significantly lower the runtime. This speedup is also shown for the graph coloring problem, although the application of the beam restrictions is not as straightforward there as with the MISP. Both the dynamic variable ordering and the new local bound for the MISP show great promise in increasing the efficiency of the search, but are both held back by the additional overhead they introduce. Fortunately, these two techniques can share the added overhead while gaining the combined benefits, resulting in great performance for the MISP when both methods are used together.

CP for Scheduling under Uncertainty

A Comparative Study of STNUs against Proactive and Reactive Approaches

Bachelor thesis (2025) - M.C. Steeghs (author) , M.M. de Weerdt (mentor) , K.C. van den Houten (mentor) , L.R. Planken (mentor) , J.A. Baaijens (graduation committee member)

This report investigates the effectiveness of Simple Temporal Networks with Uncer- tainty (STNUs) for solving the Stochastic Flexible Job-Shop Scheduling Problem with Sequence-Dependent Setup Times (SFJSP-SDST), comparing it against proactive and reactive Constraint Programming ( ...

Algorithms for dynamic scheduling in manufacturing, towards digital factories: Improving Deadline Feasibility and Responsiveness via Temporal Networks

Bachelor thesis (2025) - I. Hedea (author) , L.R. Planken (mentor) , K.C. van den Houten (mentor) , M.M. de Weerdt (mentor) , J.A. Baaijens (graduation committee member)

Modern manufacturing systems must meet hard delivery deadlines while coping with stochastic task durations caused by process noise, equipment variability, and human intervention. Traditional deterministic schedules break down when reality deviates from nominal plans, triggering c ...

Algorithms for dynamic scheduling in manufacturing, towards digital factories

Flexible Job Shop Scheduling Problems (FJSPs) with generalized time-lags and no-wait constraints

Bachelor thesis (2025) - B. Paramon (author) , M.M. de Weerdt (mentor) , L.R. Planken (mentor) , K.C. van den Houten (mentor) , J.A. Baaijens (graduation committee member)

This study investigates scheduling strategies for the stochastic duration flexible job-shop problem with no-wait and general time lags constraints (FJSP/NW-GTL). Progress in Constraint Programming (CP) and temporal-networks has renewed interest in assessing the strengths and limi ...

Comparing Dynamic Scheduling Algorithms for Multi-Mode RCPSP/max under Uncertainty

A Comparative Analysis on the Proactive, Reactive, and STNU algorithms with Generalised Time-Lags and No-Wait Constraints

Bachelor thesis (2025) - J.G. Meerovici Goryn (author) , L.R. Planken (mentor) , K.C. van den Houten (mentor) , M.M. de Weerdt (mentor) , J.A. Baaijens (graduation committee member)

This study investigates the performance of three dynamic scheduling approaches—proactive, reactive, and STNU-based—for solving the Multi-Mode Resource-Constrained Project Scheduling Problem with maximal time-lags and no-wait constraints (MMRCPSP/max) in uncertain environments. T ...

Self-Supervised Learning with Formal Guarantees for Energy Systems Optimization

Primal-Dual Solutions, Objective Bounds, and Benders Cuts

Master thesis (2025) - B.M. Jacobs (author) , M.M. de Weerdt (mentor) , G. Neustroev (mentor) , Simon H. Tindemans (graduation committee member)

The transition towards renewable energy requires long-term energy system planning, which depends on solving constrained optimization (CO) problems. These CO problems are becoming increasingly complex, particularly due to the variability introduced by renewable energy sources. Tra ...

The transition towards renewable energy requires long-term energy system planning, which depends on solving constrained optimization (CO) problems. These CO problems are becoming increasingly complex, particularly due to the variability introduced by renewable energy sources. Traditional optimization methods struggle to scale with this growing model complexity. In contrast, machine learning approaches allow faster solution computation, but offer only approximate solutions with no guarantees on solution quality, thereby limiting reliability and interpretability in planning applications.

This research addresses these limitations by exploring the use of neural networks to predict feasible solution pairs for the primal and dual formulations of the CO problem, enabling approximate solutions accompanied by bounds on suboptimality. Self-supervised primal-dual learning (PDL) is adapted and extended to produce paired feasible solutions for both the primal and dual formulations of an economic dispatch problem. Feasibility in the primal network is enforced through differentiable repair and completion layers, including novel domain-specific extensions that adjust supply-demand balancing priorities, which were found to be essential for predictive accuracy. This then exposes a structural limitation of PDL: while repair and completion layers are essential for primal learning, they prevent the learning of meaningful dual predictions. To address this, the dual network is trained using a modified loss function that directly optimizes the dual problem, enabling the use of a completion layer adopted from the literature to ensure dual feasibility. An additional novel classification-based layer that incorporates prior knowledge on the dual variables further improves the dual prediction quality when applied to the economic dispatch problem. Finally, the trained networks are integrated into Benders decomposition, a technique that breaks CO problems into easier, independent problems. This enables a hybrid approach: approximate solutions with bounds on suboptimality can be obtained swiftly, whereas exact solving remains available if necessary, retaining all theoretical convergence and optimality guarantees and potentially reducing computational time.

The proposed framework is evaluated on a generation expansion planning problem, where the economic dispatch subproblems are solved iteratively using the trained networks. The results demonstrate a theoretically grounded and empirically validated proof of concept for producing solutions with quality guarantees using learning-based methods, which is generally applicable to any problem compatible with Benders decomposition where the resulting subproblem admits a conic formulation. However, the results do not show conclusive speed-ups due to a mismatch between the training data and the data encountered during Benders iterations. By demonstrating how neural networks can be used to generate approximate, feasible solutions accompanied by theoretical guarantees on solution quality, this research contributes to the advancement of scalable and reliable constrained optimization methods for energy systems.

Optimization for Production Planning using Probabilistic Simple Temporal Networks

Master thesis (2025) - A.G. Kalandadze (author) , M.M. de Weerdt (mentor) , K.C. van den Houten (mentor)

Production planning in the biomanufacturing sector presents significant challenges due to uncertainties in job durations caused by biological variability, environmental conditions, and raw material quality. Traditional scheduling methods typically fail to adapt to these uncertain ...

Improving group project matchings of TU Delft’s Project Forum

Master thesis (2025) - P.A. Louchtch (author) , M.M. de Weerdt (mentor) , M.A. Migut (graduation committee member)

This thesis investigates improving the group project matching algorithm of TU Delft's Project Forum platform. We formalize the matching problem as a many-to-one, one-sided matching with group formation, where students have preferences over project topics and may wish to pregroup ...

Decision Diagram Focused Learning

Master thesis (2025) - J. Schaap (author) , M.M. de Weerdt (mentor) , J.G.M. van der Linden (mentor) , K. Sidorov (mentor) , T.E.P.M.F. Abeel (graduation committee member)

Decision-Focused Learning (DFL) focuses on a setting where a system gets as input some features and needs to predict coefficients to a downstream optimization problem. Classically, one would apply a two-stage solution, which trains the predictor as a regression task and only uses ...

“From iMage to Market”: Machine-Learning-Empowered Fruit Supply

Doctoral thesis (2025) - J. Wen (author) , M.M. de Weerdt (promotor) , T.E.P.M.F. Abeel (promotor)

Artificial intelligence (AI) has become a widely discussed and transformative technology, with its adoption growing across industries to drive insights and impact. In this thesis, we explore how AI methods and algorithms can facilitate the operation of soft-fruit supply chains, u ...

Artificial intelligence (AI) has become a widely discussed and transformative technology, with its adoption growing across industries to drive insights and impact. In this thesis, we explore how AI methods and algorithms can facilitate the operation of soft-fruit supply chains, using strawberries as a case study.

The thesis begins by presenting the general background and various perspectives from related works on how AI and machine learning (ML) have been applied to address problems in agricultural or horticultural practices. This includes tasks that, while not directly optimizing supply strategies, still contribute to solving broader challenges. In a nutshell, this thesis categorizes the scope of study into three scales: the single-fruit scale, the greenhouse scale, and the market scale. Within each scale, we review the existing research, identify knowledge gaps, and introduce robust and applicable methodologies capable of dealing with real-world conditions.

Since no publicly available datasets met the requirements of the research plan, we established several datasets for research on the soft-fruit supply chain through collecting, annotating, and (pre-)processing data. These newly curated datasets not only support the research presented in this thesis but also lay a foundation for future research from various perspectives. Details about these datasets are introduced in Chapter 2. Moreover, we conceptualize the process of gathering longitudinal observations from growth monitoring images as a multiple object tracking (MOT) task. We named the image collection and their MOT annotations as ``The Growing Strawberries (GSD)''. The computer vision challenge that GSD brings are further benchmarked and discussed in Chapter 3. Following this, the core contributions of the thesis is presented from Chapter 3 to 6, each corresponding to a published paper or one currently under review. Finally, Chapter 7 summarizes the research findings, answering the research questions proposed in Chapter 1 and discussing the overall work of the thesis.

We discuss these contributions for each of the three mentioned scales separately:

At the fruit scale, we designed and analyzed novel methodologies to keep track of the fruit growths and to predict key properties, including both external characteristics like ripeness and internal qualities such as sweetness. For the ripeness, we propose to use appearance properties, mainly the hue, as an objective metric to quantify it. For the sweetness, we trained deep neural networks to perform non-destructive prediction using environmental and image data, individually and integrally.

Our employment of color analysis and ML models provides a non-destructive and generalizable manner that ensures consistency when upstream and downstream parties in a supply chain estimate the properties of fruits. Meanwhile, the models perform comparatively with laboratory benchmarks even under imperfect, outdoor data collection. We further demonstrated the model in a mobile app to further facilitate adoption in the field.

By benchmarking state-of-the-art MOT algorithms on GSD, we illustrated the new challenges that are brought by this use case: first, the MOT objects change appearance during the tracking due to their biological development, and second, sparse frame rates introduce irregular movements from image to image. We showcased how fruit properties, such as ripeness, change over its life cycle. The results not only provide quantitative measurements that describe the fruit's biological development, but also depict the pain points of current MOT algorithms' predictions. In the meantime, by quantifying these changes over the biological development, we also retrieval relevant information and datasets to support predictions of the changes.

At the greenhouse scale, we designed a framework that optimizes the timing of fruit harvesting by integrating the aforementioned quantified changes over biological development, based on sequential demands about the desired quantities to be harvested. Essentially, the framework makes fruit-specific decisions on dates of harvests by leveraging the monitoring data. The decisions are thus made to enhance both current and future demand-fulfillment capabilities. At each stage of this framework, we evaluated various methods and discussed their effectiveness in achieving the stage targets. For example, how to process the infield data to achieve coherent functions about the ripeness development, how to predict future changes, how to include different perspectives in the optimization model, and etc. As the decisions are made for each specific fruit, the work also demonstrates significant potential for integration with mobile apps and harvesting robots. On top of that, the information retrieval function can also serve as a standalone application to provide objective fruit-level quality assessment.

At the market scale, we focus on the portfolio optimization of a grower under a widely applied mechanism of the market system: the majority of demands for harvests are predetermined through advance contracts, which also serves as an a priori condition of the solution proposed at the greenhouse level. The local market, with dynamic prices and demands, can be used to save losses from the difference in contracted demands and the actual yield. To mitigate outlying decision failures, we introduced the ``smart predict-then-optimize (SPO)'' method, which trains models to predict future yield and local market prices.
Our results illustrate that SPO loss primarily affects the bias layer in neural networks, contrasting with models trained using mean squared error (MSE). This difference essentially leads to more conservative estimations in decision-making scenarios, and also motivates and highlights the importance of effective MSE-based pre-training. Additionally, our study reveals how SPO loss makes models interact when multiple neural networks are trained to predict decision parameters with diverse functions. This insight expands the applicability of SPO loss across a broader range of use cases and model architectures, underscoring its contribution to the field of decision-focused learning.

In conclusion, this thesis introduces diverse data-driven methodologies to tackle the distinct tasks involved in optimizing fruit supply, using strawberries as a case study. Central to our approach is the effective utilization of data, which serves as the foundation for solutions that span from fruit-level evaluations to market-level planning. By leveraging analytics of non-destructive data, our solutions provide objective estimations of fruit quality, fostering a more consistent shared understanding between sellers and buyers while reducing potential food waste. Overall, these advancements push the boundaries of AI in supporting decision-making during the supply of soft fruits, particularly for smaller growers. The findings not only empower more efficient and sustainable supply chain operations but also highlight the strong potential for many practical real-world applications.

Optimization under Uncertainty through Problem Reformulations

Doctoral thesis (2025) - G. Veviurko (author) , M.M. de Weerdt (promotor) , J.W. Böhmer (copromotor)

The research in this thesis falls within the realm of optimization under uncertainty, a crucial area in computer science and mathematics with broad applications in power systems, finance, machine learning, healthcare, and more. This thesis presents three main contributions across ...

Decentralized Multi-Agent Conflict Resolution in Path Planning

Master thesis (2024) - M.N. Stroia (author) , K.G. Langendoen (mentor) , M.M. de Weerdt (mentor) , Tom van Groeningen (mentor)

Path finding is an important component in solving a wide array of engineering problems, ranging from video games to real-life applications such as automated warehouse management and autonomous vehicles.
Path finding algorithms are designed to solve complex problems, and in o ...

Path finding is an important component in solving a wide array of engineering problems, ranging from video games to real-life applications such as automated warehouse management and autonomous vehicles.
Path finding algorithms are designed to solve complex problems, and in order to do so, assumptions are necessary to simplify the problems.
While these assumptions are important, using them makes the obtained algorithms less applicable to a real-life scenario, and as such, verifying how lifting some of them would affect the obtained results is worth pursuing.

Two main assumptions were identified and subsequently lifted.
First, classic multi-agent path finding algorithms use a centralized approach, where solutions are computed before execution.
This results in a long computation period followed by execution. Lifting this assumption results in a decentralized approach where agents solve conflicts on the go, while approaching their target.
The second assumption made by state of the art algorithms is that agents participating in a multi-agent path finding problem share a common goal: minimizing a global cost function.
This is not always applicable, as in a real-life scenario participating agents can have selfish goals.
This assumptions has been lifted by allowing agents to negotiate their paths by trading with the other participants to create better solutions for themselves.

The Selfish Localized Pathfinding (SLP) algorithm has been designed to lift these assumptions. It describes a decentralized algorithm that allows participating agents to negotiate their paths, which makes a good candidate for an application closer to real-life.

The SLP algorithm has been tested in order to evaluate its performance, both in terms of its ability to solve a set of test cases, and in terms of the cost incurred by the participating agents.
SLP performed well in varied domains.
SLP solved significantly more cases than Conflict Based Search, a centralized state of the art path finding algorithm.
This comes at the expense of an increase in the path lengths obtained by the algorithm.
This downside is offset by the significant decrease in the time required to solve problems, which can be divided in small clusters due to the decentralized approach of the algorithm.
On the whole, SLP can provide a good alternative to Conflict Based Search.

Decision-Focused Learning for Scheduling Problems with Uncertainty in the Constraints

Master thesis (2024) - A. Marinov (author) , M.M. de Weerdt (mentor) , K.C. van den Houten (mentor) , D.M.J. Tax (coach)

When addressing combinatorial optimization problems, the focus is predominantly on their computational complexity, and it is often forgotten to look at the bigger picture. As a result, it is common to miss critical details which could play a major role in the overall process. One ...

Detecting Patterns in Train Position Data of Trains in Shunting Yards

Analysis of Arrival Time Distributions and Delays

Bachelor thesis (2024) - A.C. Krudde (author) , M.M. de Weerdt (mentor) , I.K. Hanou (mentor) , J. Sun (graduation committee member)

Shunting yards are locations next to train stations that serve as parking places for trains when they are not in operation and often contain facilities for maintenance and cleaning for passenger trains. Planning of the tasks regarding shunting trains involves routing, assignment ...

Learning Patterns in Train Position Data

Classifying locations by identifying station specific patterns

Bachelor thesis (2024) - I.Y. Smilenov (author) , M.M. de Weerdt (mentor) , I.K. Hanou (mentor) , J. Sun (graduation committee member)

Solutions for the Train Unit Shunting Problem are constantly being researched and improved to be- come more efficient and match the needs of train transport in the Netherlands. For this reason, we are exploring new ways to find patterns in the train data to identify where those s ...

Examining Manual Solutions of the Train Unit Shunting Problem to find Train Type Patterns

Bachelor thesis (2024) - M. van Pelt (author) , M.M. de Weerdt (mentor) , I.K. Hanou (mentor) , J. Sun (graduation committee member)

This paper analyses manually realised solutions to the Train Unit Shunting Problem (TUSP) to find patterns in train type. The parking element is most important for the TUSP. Therefore, this research specifically investigates the presence of train type patterns in parking track an ...

Generalisation Ability of Proper Value Equivalence Models in Model-Based Reinforcement Learning

Bachelor thesis (2024) - S. Bratus (author) , J. He (mentor) , M.M. de Weerdt (coach) , F.A. Oliehoek (graduation committee member)

We investigate the generalization performance of predictive models in model-based reinforcement learning when trained using maximum likelihood estimation (MLE) versus proper value equivalence (PVE) loss functions. While the more conventional MLE loss aims to fit models to predict ...