A. van Deursen | TU Delft Repository

EDATA

Energy Debugging And Testing for Android

Conference paper (2025) - Erik Blokland (author) , Luis Cruz (author) , A. van Deursen (author)

Energy consumption of software is becoming increasingly important in today’s mobile-focused world, but knowledge and techniques with which to measure energy consumption have lagged behind. This paper introduces a methodology for measuring the energy consumption of Android apps at ...

A Qualitative Investigation into LLM-Generated Multilingual Code Comments and Automatic Evaluation Metrics

Conference paper (2025) - J.B. Katzy (author) , Yongcheng Huang (author) , Gopal Raj Panchu (author) , Maksym Ziemlewski (author) , Paris Loizides (author) , Sander Vermeulen (author) , A. van Deursen (author) , M. Izadi (author)

Large Language Models are essential coding assistants, yet their training is predominantly English-centric. In this study, we evaluate the performance of code language models in non-English contexts, identifying challenges in their adoption and integration into multilingual workf ...

Sustainable Machine Learning Retraining

Optimizing Energy Efficiency Without Compromising Accuracy

Conference paper (2025) - Lorena Poenaru-Olaru (author) , J. Sallou (author) , Luis Cruz (author) , Jan S. Rellermeyer (author) , A. van Deursen (author)

The reliability of machine learning (ML) software systems is heavily influenced by changes in data over time. For that reason, ML systems require regular maintenance, typically based on model retraining. However, retraining requires significant computational demand, which makes i ...

Improving the Reliability of Failure Prediction Models through Concept Drift Monitoring

Conference paper (2025) - L. Poenaru-Olaru (author) , Luis Cruz (author) , Jan S. Rellermeyer (author) , A. van Deursen (author)

Failure prediction models can be significantly beneficial for managing large-scale complex software systems, but their trustworthiness is severely affected by changes in the data over time, also known as concept drift. Thus, monitoring these models against concept drift and retra ...

The Heap: A Contamination-Free Multilingual Code Dataset for Evaluating Large Language Models

Conference paper (2025) - J.B. Katzy (author) , R.M. Popescu (author) , A. van Deursen (author) , M. Izadi (author)

The recent rise in the popularity of large language models has spurred the development of extensive code datasets needed to train them. This has left limited code available for collection and use in the downstream investigation of specific behaviors, or evaluation of large langua ...

Faithful Model Explanations through Energy-Constrained Conformal Counterfactuals

Journal article (2024) - P. Altmeyer (author) , Mojtaba Farmanbar (author) , A. van Deursen (author) , C.C.S. Liem (author)

Counterfactual explanations offer an intuitive and straightforward way to explain black-box models and offer algorithmic recourse to individuals. To address the need for plausible explanations, existing work has primarily relied on surrogate models to learn how the input data is ...

Understanding Concurrency Bugs in Real-World Programs with Kotlin Coroutines

Conference paper (2024) - Bob Brockbernd (author) , Nikita Koval (author) , A. van Deursen (author) , Burcu Kulahcioglu Ozkan (author)

Kotlin language has recently become prominent for developing both Android and server-side applications. These programs are typically designed to be fast and responsive, with asynchrony and concurrency at their core. To enable developers to write asynchronous and concurrent code s ...

On the Difficulty of Identifying Incident-Inducing Changes

Conference paper (2024) - E. Kapel (author) , Luis Cruz (author) , D. Spinellis (author) , A. van Deursen (author)

Effective change management is crucial for businesses heavily reliant on software and services to minimise incidents induced by changes. Unfortunately, in practice it is often difficult to effectively use artificial intelligence for IT Operations (AIOps) to enhance service manage ...

Data vs. Model Machine Learning Fairness Testing

An Empirical Study

Conference paper (2024) - A. Shome (author) , Luis Cruz (author) , A. van Deursen (author)

Although several fairness definitions and bias mitigation techniques exist in the literature, all existing solutions evaluate fairness of Machine Learning (ML) systems after the training stage. In this paper, we take the first steps towards evaluating a more holistic approach by ...

Is your anomaly detector ready for change? adapting aiops solutions to the real world

Conference paper (2024) - L. Poenaru-Olaru (author) , Natalia Karpova (author) , Luis Cruz (author) , Jan S. Rellermeyer (author) , A. van Deursen (author)

Anomaly detection techniques are essential in automating the monitoring of IT systems and operations. These techniques imply that machine learning algorithms are trained on operational data corresponding to a specific period of time and that they are continuously evaluated on new ...

A Transformer-Based Approach for Smart Invocation of Automatic Code Completion

Conference paper (2024) - A.D. de Moor (author) , A. van Deursen (author) , M. Izadi (author)

Transformer-based language models are highly effective for code completion, with much research dedicated to enhancing the content of these completions. Despite their effectiveness, these models come with high operational costs and can be intrusive, especially when they suggest to ...

Enhancing Incident Management

Insights from a Case Study at ING

Conference paper (2024) - Eileen Kapel (author) , Luis Cruz (author) , D. Spinellis (author) , A. van Deursen (author)

An incident management process is necessary in businesses that depend strongly on software and services. A proper process is essential to guarantee that incidents are well-handled, especially in a financial software-defined business needing to adhere to guidelines and regulations ...

CheckMate: Evaluating Checkpointing Protocols for Streaming Dataflows

Conference paper (2024) - G. Siachamis (author) , K. Psarakis (author) , M. Fragkoulis (author) , A. van Deursen (author) , Paris Carbone (author) , A Katsifodimos (author)

Stream processing in the last decade has seen broad adoption in both commercial and research settings. One key element for this success is the ability of modern stream processors to handle failures while ensuring exactly-once processing guarantees. At the moment of writing, virtu ...

Context-Aware Automated Sprint Plan Generation for Agile Software Development

Conference paper (2024) - E. Kula (author) , A. van Deursen (author) , G. Gousios (author)

Sprint planning is essential for the successful execution of agile software projects. While various prioritization criteria influence the selection of user stories for sprint planning, their relative importance remains largely unexplored, especially across different project conte ...

Evaluating Stream Processing Autoscalers

Conference paper (2024) - G. Siachamis (author) , G.C. Christodoulou (author) , K. Psarakis (author) , M. Fragkoulis (author) , A. van Deursen (author) , A Katsifodimos (author)

While the concept of large-scale stream processing is very popular nowadays, efficient dynamic allocation of resources is still an open issue in the area. The database research community has yet to evaluate different autoscaling techniques for stream processing engines under a ro ...

Enriching Source Code with Contextual Data for Code Completion Models

An Empirical Study

Conference paper (2023) - Tim van Dam (author) , M. Izadi (author) , A. van Deursen (author)

Transformer-based pre-trained models have recently achieved great results in solving many software engineering tasks including automatic code completion which is a staple in a developer’s toolkit. While many have striven to improve the code-understanding abilities of such models, ...

Retrain AI Systems Responsibly! Use Sustainable Concept Drift Adaptation Techniques

Conference paper (2023) - L. Poenaru-Olaru (author) , J. Sallou (author) , Luis Cruz (author) , Jan S. Rellermeyer (author) , A. van Deursen (author)

Deployed machine learning systems often suffer from accuracy degradation over time generated by constant data shifts, also known as concept drift. Therefore, these systems require regular maintenance, in which the machine learning model needs to be adapted to concept drift. The l ...

STACC: Code Comment Classification using SentenceTransformers

Conference paper (2023) - A. Al-Kaswan (author) , M. Izadi (author) , A. van Deursen (author)

Code comments are a key resource for information about software artefacts. Depending on the use case, only some types of comments are useful. Thus, automatic approaches to clas-sify these comments have been proposed. In this work, we address this need by proposing, STACC, a set o ...

Towards Evaluating Stream Processing Autoscalers

Conference paper (2023) - G. Siachamis (author) , Job Kanis (author) , Wybe Koper (author) , K. Psarakis (author) , M. Fragkoulis (author) , A. van Deursen (author) , A Katsifodimos (author)

In this work, we evaluate autoscaling solutions for stream processing engines. Although autoscaling has become a mainstream subject of research in the last decade, the database research community has yet to evaluate different autoscaling techniques under a proper benchmarking set ...

Dynamic Prediction of Delays in Software Projects using Delay Patterns and Bayesian Modeling

Conference paper (2023) - E. Kula (author) , Eric Greuter (author) , A. van Deursen (author) , G. Gousios (author)

Modern agile software projects are subject to constant change, making it essential to re-asses overall delay risk throughout the project life cycle. Existing effort estimation models are static and not able to incorporate changes occurring during project execution. In this paper, ...