A. van Deursen | TU Delft Repository

A Transformer-Based Approach for Smart Invocation of Automatic Code Completion

Conference paper (2024) - A.D. de Moor (author) , A. Van Van Deursen (author) , Maliheh Izadi (author)

Transformer-based language models are highly effective for code completion, with much research dedicated to enhancing the content of these completions. Despite their effectiveness, these models come with high operational costs and can be intrusive, especially when they suggest to ...

On the Difficulty of Identifying Incident-Inducing Changes

Conference paper (2024) - Eileen Kapel (author) , Luís Cruz (author) , D. Spinellis (author) , Arie Deursen (author)

Effective change management is crucial for businesses heavily reliant on software and services to minimise incidents induced by changes. Unfortunately, in practice it is often difficult to effectively use artificial intelligence for IT Operations (AIOps) to enhance service manage ...

Enhancing Incident Management

Insights from a Case Study at ING

Conference paper (2024) - Eileen Kapel (author) , Luís Cruz (author) , DIomidis Spinellis (author) , Arie van Deursen (author)

An incident management process is necessary in businesses that depend strongly on software and services. A proper process is essential to guarantee that incidents are well-handled, especially in a financial software-defined business needing to adhere to guidelines and regulations ...

Is your anomaly detector ready for change? adapting aiops solutions to the real world

Conference paper (2024) - Lorena Poenaru-Olaru (author) , Natalia Karpova (author) , Luís Cruz (author) , Jan S. Rellermeyer (author) , A. Van Van Deursen (author)

Anomaly detection techniques are essential in automating the monitoring of IT systems and operations. These techniques imply that machine learning algorithms are trained on operational data corresponding to a specific period of time and that they are continuously evaluated on new ...

Context-Aware Automated Sprint Plan Generation for Agile Software Development

Conference paper (2024) - Elvan Kula (author) , Arie van Deursen (author) , G. Gousios (author)

Sprint planning is essential for the successful execution of agile software projects. While various prioritization criteria influence the selection of user stories for sprint planning, their relative importance remains largely unexplored, especially across different project conte ...

Data vs. Model Machine Learning Fairness Testing

An Empirical Study

Conference paper (2024) - A. Shome (author) , Luís Cruz (author) , A. Van Van Deursen (author)

Although several fairness definitions and bias mitigation techniques exist in the literature, all existing solutions evaluate fairness of Machine Learning (ML) systems after the training stage. In this paper, we take the first steps towards evaluating a more holistic approach by ...

Understanding Concurrency Bugs in Real-World Programs with Kotlin Coroutines

Conference paper (2024) - Bob Brockbernd (author) , Nikita Koval (author) , A. Deursen (author) , Burcu Külahçıoğlu Özkan (author)

Kotlin language has recently become prominent for developing both Android and server-side applications. These programs are typically designed to be fast and responsive, with asynchrony and concurrency at their core. To enable developers to write asynchronous and concurrent code s ...

Evaluating Stream Processing Autoscalers

Conference paper (2024) - G. Siachamis (author) , George Christodoulou (author) , K. Psarakis (author) , M. Fragkoulis (author) , A van Deursen (author) , A. Katsifodimos (author)

While the concept of large-scale stream processing is very popular nowadays, efficient dynamic allocation of resources is still an open issue in the area. The database research community has yet to evaluate different autoscaling techniques for stream processing engines under a ro ...

Faithful Model Explanations through Energy-Constrained Conformal Counterfactuals

Journal article (2024) - P. Altmeyer (author) , Mojtaba Farmanbar (author) , A Deursen (author) , Cynthia C.S. Liem (author)

Counterfactual explanations offer an intuitive and straightforward way to explain black-box models and offer algorithmic recourse to individuals. To address the need for plausible explanations, existing work has primarily relied on surrogate models to learn how the input data is ...

CheckMate: Evaluating Checkpointing Protocols for Streaming Dataflows

Conference paper (2024) - G. Siachamis (author) , K. Psarakis (author) , M. Fragkoulis (author) , A. Van Deursen (author) , Paris Carbone (author) , A. Katsifodimos (author)

Stream processing in the last decade has seen broad adoption in both commercial and research settings. One key element for this success is the ability of modern stream processors to handle failures while ensuring exactly-once processing guarantees. At the moment of writing, virtu ...

Maintaining and Monitoring AIOps Models Against Concept Drift

Conference paper (2023) - Lorena Poenaru-Olaru (author) , Luis Cruz (author) , Jan S. Rellermeyer (author) , Arie Van Deursen (author)

AIOps solutions enable faster discovery of failures in operational large-scale systems through machine learning models trained on operation data. These models become outdated during the occurrence of concept drift, a term used to describe shifts in data distributions. In operatio ...

STACC: Code Comment Classification using SentenceTransformers

Conference paper (2023) - Ali Al-Kaswan (author) , Maliheh Izadi (author) , A. Van Van Deursen (author)

Code comments are a key resource for information about software artefacts. Depending on the use case, only some types of comments are useful. Thus, automatic approaches to clas-sify these comments have been proposed. In this work, we address this need by proposing, STACC, a set o ...

Dynamic Prediction of Delays in Software Projects using Delay Patterns and Bayesian Modeling

Conference paper (2023) - E. Kula (author) , Eric Greuter (author) , Arie van Deursen (author) , Georgios Georgios (author)

Modern agile software projects are subject to constant change, making it essential to re-asses overall delay risk throughout the project life cycle. Existing effort estimation models are static and not able to incorporate changes occurring during project execution. In this paper, ...

Uncovering Energy-Efficient Practices in Deep Learning Training

Preliminary Steps Towards Green AI

Conference paper (2023) - Tim Yarally (author) , Luís Cruz (author) , Daniel Feitosa (author) , June Sallou (author) , Arie Van Deursen (author)

Modern AI practices all strive towards the same goal: better results. In the context of deep learning, the term "results"often refers to the achieved accuracy on a competitive problem set. In this paper, we adopt an idea from the emerging field of Green AI to consider energy cons ...

Extending Source Code Pre-Trained Language Models to Summarise Decompiled Binaries

Conference paper (2023) - A. Al-Kaswan (author) , Toufique Ahmed (author) , Maliheh Izadi (author) , Anand Ashok Sawant (author) , Premkumar Devanbu (author) , Arie Deursen (author)

Binary reverse engineering is used to understand and analyse programs for which the source code is unavailable. Decompilers can help, transforming opaque binaries into a more readable source code-like representation. Still, reverse engineering is difficult and costly, involving c ...

Endogenous Macrodynamics in Algorithic Recourse [VIDEO]

Other (2023) - P. Altmeyer (author) , G.J.A. Angela (author) , Aleksander Buszydlik (author) , Karol Dobiczek (author) , Arie Deursen (author) , C.C.S. Liem (author)

Existing work on Counterfactual Explanations (CE) and Algorithmic Recourse (AR) has largely been limited to the static setting and focused on single individuals: given some estimated model, the goal is to find valid counterfactuals for an individual instance that fulfill various ...

Targeted Attack on GPT-Neo for the SATML Language Model Data Extraction Challenge [PRESENTATION]

Other (2023) - Ali Al-Kaswan (author) , Maliheh Izadi (author) , A. Van Van Deursen (author)

Previous work has shown that Large Language Models are susceptible to so-called data extraction attacks. This allows an attacker to extract a sample that was contained in the training data, which has massive privacy implications. The construction of data extraction attacks is cha ...

Generating Class-Level Integration Tests Using Call Site Information

Journal article (2023) - P. Derakhshanfar (author) , Xavier Devroey (author) , Annibale Panichella (author) , AE Zaidman (author) , A. Van Deursen (author)

Search-based approaches have been used in the literature to automate the process of creating unit test cases. However, related work has shown that generated tests with high code coverage could be ineffective, i.e., they may not detect all faults or kill all injected mutants. In t ...

Getting Things Done

The Eelco Way

Conference paper (2023) - A. Van Deursen (author)

Eelco Visser (1966–2022) was a leading member of the department of Software Technology (ST) of the faculty of Electrical Engineering Mathematics, and Computer Science (EEMCS) of Delft University of Technology. He had a profound influence on the educational programs in computer sc ...

Explaining Black-Box Models through Counterfactuals

Conference paper (2023) - P. Altmeyer (author) , Cynthia CS Liem (author) , A. Van Deursen (author)

We present CounterfactualExplanations.jl: a package for generating Counterfactual Explanations (CE) and Algorithmic Recourse (AR) for black-box models in Julia. CE explain how inputs into a model need to change to yield specific model predictions. Explanations that involve realis ...