Impact of Considering Artificial Worst-Case Scenarios Within Clustering Algorithms

None, None

Impact of Considering Artificial Worst-Case Scenarios Within Clustering Algorithms

A case study through three newly adapted clustering algorithms

Bachelor Thesis (2026)

Author(s)

R.P.L. Novosel (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

G.A. Morales España – Mentor (TU Delft - Electrical Engineering, Mathematics and Computer Science)

M.B. Elgersma – Mentor (TU Delft - Electrical Engineering, Mathematics and Computer Science)

J.A. Baaijens – Graduation committee member (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Faculty

Electrical Engineering, Mathematics and Computer Science

Temporal aggregation Representative days Extreme-preserving clustering Worst-case scenarios Non-dominated sorting

To reference this document use

https://resolver.tudelft.nl/uuid:652a43ae-3680-4b6c-a262-9e9fd8375701

More Info

expand_more

Publication Year

2026

Language

English

Graduation Date

24-06-2026

Awarding Institution

Delft University of Technology

Project

CSE3000 Research Project

Programme

Computer Science and Engineering

Faculty

Electrical Engineering, Mathematics and Computer Science

Downloads counter

9

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Planning a long-term energy system relies on models that simulate system operation over many years at an hourly level, which is computationally expensive. A common remedy is temporal aggregation: grouping similar time periods and representing each group by one typical period to shrink the dataset the model must process. This speeds up the computation but tends to average away rare yet demanding conditions, such as days with high energy demand and little energy availability. These extreme periods, however, often determine how much capacity the system requires. This paper introduces three adaptations of widely used clustering algorithms that deliberately embed synthetic worst-case periods into the clustering process, ensuring the representative periods do not ignore the most demanding conditions. We evaluate them against four standard baselines (K-Means, K-Medoids, K-Medoids WC (worst-case), and Hull clustering) by measuring how closely each method's investment decisions match those of a benchmark model that uses the full, unaggregated data: a gap we call relative regret. The standard methods often require a large number of representative periods to approach the benchmark, whereas the proposed worst-case method WCA-K-Means reaches near-benchmark decisions with far fewer periods. By capturing the conditions that drive capacity needs without partitioning the data into excessive detail, it represents a full year with a much smaller dataset, giving planners results that closely match a full-resolution model while substantially reducing the computational cost of solving the energy model.

Files

Research_project_5928761_final... (pdf)

(pdf | 1.84 Mb)

License info not available