A. Katsifodimos | TU Delft Repository

Educational Content on YouTube: The Case of Data Systems

Master thesis (2025) - X. Ling (author) , Maria Soledad Pera (mentor) , E.A. Aivaloglou (mentor) , Asterios Katsifodimos (graduation committee member)

The advancement of data systems demands continuous learning, yet traditional educational materials often fall short of meeting evolving learning needs. YouTube has emerged as a widely used platform for informal learning, but its role in data systems education remains underexamine ...

Simulating and Analyzing the Performance of TCP Under Extreme Conditions

Impact of SDN-induced routing changes on TCP BBR

Bachelor thesis (2025) - A.M. Şologon (author) , A. Zapletal (mentor) , Fernando A. Kuipers (mentor) , Asterios Katsifodimos (graduation committee member)

Frequent route changes in modern SDN-based net works are known to severely degrade the performance of TCP Cubic. This degradation is caused by two factors: sudden RTT changes, and packet reordering which Cubic misinterprets as congestion. This research investigates how a modern a ...

Testing the impact of in-transmission bandwidth and delay variation on selected TCP variants

Bachelor thesis (2025) - K. Gniaź (author) , A. Zapletal (mentor) , Fernando A. Kuipers (mentor) , Asterios Katsifodimos (graduation committee member)

The Transmission Control Protocol (TCP) remains the cornerstone of modern network communication, enabling reliable and ordered data delivery across a wide range of network environments. Despite its ubiquity, TCP’s variants’ performance under extreme and highly variable network co ...

Investigating the Impact of ACK Aggregation on TCP Performance using ns-3

Evaluation of Transport and MAC-Layer Aggregation Techniques

Bachelor thesis (2025) - H. Heinczinger (author) , Fernando A. Kuipers (mentor) , A. Zapletal (mentor) , Asterios Katsifodimos (graduation committee member)

Modern TCP congestion control algorithms rely on timely ACK feedback to adjust their parameters. However, some networks deliberately suppress ACKs. This study uses the ns-3 simulator to experiment with the impact of suppressing ACKs on the reverse path on four TCP variants (BBRv3 ...

Simulating and Analyzing the Performance of TCP Under Extreme Conditions

Evaluating the Impact of L4S on TCP Performance

Bachelor thesis (2025) - A. Tabacaru (author) , Fernando A. Kuipers (mentor) , A. Zapletal (mentor) , Asterios Katsifodimos (graduation committee member)

The Low-Latency, Low-Loss, ScalableThroughput (L4S) service aims to support real-time applications by enabling high throughput with sub-millisecond queueing delay. It combines
scalable ECN-based congestion control (e.g., TCP Prague) with a Dual-Queue AQM such as DualPI2 to se ...

An experimental evaluation of TCP startup algorithms

How do flow startup mechanisms impact the performance of TCP?

Bachelor thesis (2025) - M. Grigore (author) , Fernando A. Kuipers (mentor) , A. Zapletal (mentor) , Asterios Katsifodimos (graduation committee member)

Most TCP data transfers in the Internet are short. This makes the startup algorithms an important factor that impacts TCP performance. Several startup algorithms have been developed. However, not a lot of research has been conducted into how these behave and interact when used fo ...

Addressing Test Flakiness: Practical Approaches in a Database-Reliant Industrial System

Flaky Tests at Exact

Master thesis (2025) - G.J.B. Vegelien (author) , Arie Van van Deursen (mentor) , C.E. Brandt (mentor) , Bas Graaf (mentor) , Asterios Katsifodimos (graduation committee member)

In today’s rapidly evolving software landscape, where continuous integration and continuous delivery are paramount, the presence of flaky tests poses a significant obstacle. These tests, exhibiting unpredictable pass/fail behavior, hinder development progress, waste valuable reso ...

Towards Modular Language Semantics of WebDSL: A Case Study of Using Algebraic Effects in Haskell for Language Specification

Master thesis (2024) - A.K. Wolska (author) , A Katsifodimos (graduation committee member) , Jesper Cockx (mentor) , C.B. Poulsen (mentor) , DM Groenewegen (mentor)

WebDSL is a DSL for creating web applications, combining many different aspects and domains of web design in a single language. The dynamic semantics of this language are not defined, despite multiple attempts, abandoned due to complexity of the language and lack of expression of ...

Exploring Test Suite Coverage of Large Language Model–Enhanced Unit Test Generation

A Study on the Ability of Large Language Models to Improve the Understandability of Generated Unit Tests Without Compromising Coverage

Bachelor thesis (2024) - A. Drăgoi (author) , AE Zaidman (mentor) , A. Deljouyi (mentor) , Asterios Katsifodimos (graduation committee member)

Automated software testing is a frequently studied topic in specialized literature. Search-based software testing tools, like EvoSuite, can generate test suites using genetic algorithms without the developer’s input. Large Language Models (LLMs) have recently attracted significan ...

Readability Driven Test Selection

Using Large Language Models to Assign Readability Scores and Rank Auto-Generated Unit Tests

Bachelor thesis (2024) - I. Zaidi (author) , A. Deljouyi (mentor) , AE Zaidman (mentor) , Asterios Katsifodimos (graduation committee member)

Writing tests enhances quality, yet developers often deprioritize writing tests. Existing tools for automatic test generation face challenges in test under- standability. This is primarily due to the fact that these tools fail to consider the context, leading to the generation of ...

Reducing LLM Hallucinations with Retrieval Prompt Engineering

Minimising the Need for Re-prompting in Automatic Understandable Test Generation

Bachelor thesis (2024) - A. Mentzelopoulou (author) , A. Deljouyi (mentor) , AE Zaidman (mentor) , Asterios Katsifodimos (graduation committee member)

Automated test generation is the means to produce correct and usable code while maintaining an efficient and effective development process. UTGen is a tool that utilizes a Large Language Model (LLM) to improve the understandability of a test suite generated by a Search-Based Soft ...

Using LLM-Generated Summarizations to Improve the Understandability of Generated Unit Tests

Enhancing Unit Test Understandability: An Evaluation of LLM-Generated Summaries

Bachelor thesis (2024) - N. Djajadi (author) , AE Zaidman (mentor) , A. Deljouyi (mentor) , Asterios Katsifodimos (graduation committee member)

Since software testing is crucial, there has been research on generating test cases automatically. The problem is that the generated test cases can be hard to understand. Multiple factors play a role in understandability and one of them is test summarization, which provides an ov ...

Leveraging E2E Test Context for LLM-Enhanced Test Data and Descriptions

Enhancing Automated Software Testing with Runtime Data Integration

Bachelor thesis (2024) - M.C.A. de Wit (author) , A. Deljouyi (mentor) , AE Zaidman (mentor) , Asterios Katsifodimos (graduation committee member)

Automated software testing plays a critical role in improving software quality and reducing manual testing expenses. However, generating understandable and meaningful unit tests remains challenging, especially with frameworks optimized for coverage like Search-Based Software Test ...

Extending Null Embedding for Deep Neural Network (DNN) Watermarking

Improving the accuracy of the original classification task in piracy-resistant DNN watermarking

Bachelor thesis (2024) - K. ALTINAY (author) , Zekeriya Erkin (mentor) , Devris Isler (mentor) , Asterios Katsifodimos (graduation committee member)

The advancement of Machine Learning (ML) in the last decade has created new business prospects for developers working on ML models. Models that are expensive and time-consuming to design and train can now be outsourced from others to reduce costs using Machine Learning as a servi ...

Watermarking time-series data using DWT

Adapting an existing audio technique to watermark non-medical time series

Bachelor thesis (2024) - M.P. Raave (author) , Zekeriya Erkin (mentor) , Devris Isler (mentor) , Asterios Katsifodimos (graduation committee member)

Data security has become more important over the last few years as data sharing over the world has become trivial. Data ownership therefore becomes critical as data can be very valuable and vulnerable to theft. Watermarking is a technique that can help data owners prove ownership ...

Watermarking of numerical datasets used for ML

A DWT approach for watermarking numerical datasets

Bachelor thesis (2024) - M.C. Crăciun (author) , Zekeriya Erkin (mentor) , Devris Isler (mentor) , Asterios Katsifodimos (graduation committee member)

AI and machine learning have been topics of big interest in the last couple of years, with plenty of applications in many domains. To train these models into useful and desirable tools, a large amount of data is necessary. This data is expensive to collect, becoming one of the mo ...

Cost Estimation for Factorized Machine Learning

Master thesis (2024) - P.H. te Marvelde (author) , R. Hai (mentor) , W. Sun (mentor) , A Katsifodimos (graduation committee member) , S.S. Chakraborty (graduation committee member)

In the realm of machine learning (ML), the need for efficiency in training processes is paramount. The conventional first step in an ML workflow involves collecting data from various sources and merging them into a single table, a process known as materialization, which can intro ...

In the realm of machine learning (ML), the need for efficiency in training processes is paramount. The conventional first step in an ML workflow involves collecting data from various sources and merging them into a single table, a process known as materialization, which can introduce inefficiencies caused by redundant data. Factorized ML strives to reduce this by maintaining the original data forms and performing model training on the separate source tables. This approach can lead to significant increases in training efficiency.

However, factorized training does not always reduce cost compared to traditional materialized training. This research tackles this issue by examining the multidimensional cost optimization problem that emerges when deciding between factorized and traditional materialized learning methods. It fills in gaps left by prior research, which is focused on CPU-based training, by investigating the cost estimation landscape for factorized ML, with a special emphasis on GPU performance compared to CPUs. The used factorized ML framework is expanded to incorporate GPU training, a topic not explored in previous research. We demonstrate that GPU training exhibits significantly different cost characteristics than CPU training, which has substantial implications for the design of cost models and the optimization of factorized ML.

Through an empirical study, an ML-based cost model is developed that can accurately predict the faster training method for a wide range of scenarios. On an extensive evaluation with real-world datasets this model boasts an average speedup of 3.8x, versus the state-of-the-art's 0.9x. We also show that it generalizes to scenarios with datasets and hardware settings on which the model is not trained, keeping 82% of training set performance.

Our innovative cost model for factorized ML enables significant time savings in training-intensive scenarios and further underlines the benefits of factorized training. However, effort should be invested into incorporating factorized training into existing ML frameworks so this method of training a model, and our cost model, can be evaluated in a larger set of realistic scenarios.

Blind Spot Illumination in LLMs through Data Valuation and Synthetic Sample Generation

Master thesis (2024) - Chun-Chi Chen (author) , Philip Lippmann (mentor) , Jie Yang (mentor) , A Katsifodimos (graduation committee member) , Q. Wang (coach)

Large language models (LMs) are increasingly used in critical tasks, making it important that these models can be trusted. The confidence an LM assigns to its prediction is often used to indicate how much trust can be placed in that prediction. However, a high confidence can be i ...

Polka: A Differentiated Deployment System for Online and Streamed Games, Meta-verses, and Modifiable Virtual Environments

Master thesis (2024) - J.D. Eickhoff (author) , Alexandru Iosup (mentor) , Fernando A. Kuipers (mentor) , A. Katsifodimos (graduation committee member) , J Donkervliet (graduation committee member)

Online gaming is the world’s largest entertainment industry by revenue, and supports over 3 billion consumers worldwide. Many of the world’s most popular online games must manage millions of concurrent players through a single unified service. Achieving performant and scalable on ...

Online gaming is the world’s largest entertainment industry by revenue, and supports over 3 billion consumers worldwide. Many of the world’s most popular online games must manage millions of concurrent players through a single unified service. Achieving performant and scalable online games is challenging. Online games are subject to stringent quality of service requirements, notably extremely low response times, with at most 50ms being considered acceptable. Unlike many other types of applications, the performance of online games depends to a large degree on the resources available on end-user devices. These devices are typically heterogeneous, limited in compute and network resources, and subject to unpredictable changes in resource availability.
Addressing this challenge, we propose in this work the concept of differentiated deployment, which allows online games to selectively manage and scale online-game systems with fine granularity in response to changes in available resources. We design Polka, a framework for online games which supports differentiated deployment. We then implement PolkaDOTS, an open-source proof of concept of the Polka framework built in an industry standard game development ecosystem.
We evaluate our approach using Dither, a custom-built experiment runner for large scale distributed experiments on online games. We use Dither to perform real-world experiments on a representative Minecraft-like Game, Opencraft 2, built on the PolkaDOTS stack, and analyze the impact of various differentiated deployment scenarios. From these experiments, we find that differentiated deployment can decrease performance variability of online-game servers, and decrease the response time experienced by players by up to 32%. Most importantly, we show that differentiated deployment enables novel deployment techniques, including switching from local rendering to cloud-based rendering (i.e., cloud gaming) at runtime.

Building an Event-Driven Timing Simulator for Embedded Hybrid GPU-AI Accelerator

Master thesis (2023) - D. Karatza (author) , S. Wong (mentor) , Asterios Katsifodimos (graduation committee member) , G Keramidas (graduation committee member)

The current trend towards the integration of artificial intelligence (AI) and graphics processing unit (GPU) technologies has resulted in the development of embedded hybrid GPU-AI accelerators, which offer high computational power and energy efficiency. One of the key challenges ...

The current trend towards the integration of artificial intelligence (AI) and graphics processing unit (GPU) technologies has resulted in the development of embedded hybrid GPU-AI accelerators, which offer high computational power and energy efficiency. One of the key challenges in designing such accelerators is ensuring their timing correctness, as any timing violation may lead to system failure and incorrect results. To address this challenge, timing simulators have been proposed as a promising solution, as they enable accurate and efficient timing analysis of these complex systems. Nevertheless, such simulators have their limitations: detailed ones, such as cycle-accurate simulators, have high accuracy but exhibit high overhead, while faster approaches, like event-driven simulators, usually cannot achieve high accuracy levels. Therefore, the question arises: How can we implement a timing simulator to achieve a balanced trade-off between accuracy and execution time?

In this context, we introduce NEOX-V, a cutting-edge RISC-V based GPU Processor optimized for GPGPU and AI workloads. However, the current NEOX-V product lacks timing information in its simulator. This has prompted us to use it as a case study to bridge the gap between accuracy and execution time in timing simulators. This is achieved through the implementation of a new feature: a timing simulator that utilizes event-driven modeling.

To assess the proposed simulator's accuracy and effectiveness, we employ a comprehensive validation framework, using diverse workloads and configurations, from simple micro-benchmarks to intricate AI tests. The results demonstrate that the timing simulator achieves an accuracy error below 8\% when compared to the RTL equivalent for all applications, with a marginal increase in actual simulation time of only 0.7\%. It is worth noting that the timing simulator's utility extends beyond predicting execution time; it also plays a crucial role in verifying the existing design and uncovering its limitations.

Overall, this thesis makes a significant contribution to the field of computer architecture by providing a powerful tool for the design, development, and evaluation of an embedded hybrid GPU-AI accelerator called NEOX-V. It is our hope that this work will inspire further research and development in this exciting and rapidly evolving field.