C.C.S. Liem | TU Delft Repository

Automating Indicator Validation for Water Utility Benchmarking - A Data‐Driven Approach

Master thesis (2025) - P.S.P. Ramsundersingh (author) , C.C.S. Liem (mentor) , V.N.S.R. Dwarka (mentor) , T.J. Viering (graduation committee member)

Water flows through every aspect of life, yet the story of its delivery is only as reliable as the data that records it. In global benchmarking, such data is often uneven, incomplete, and rarely subjected to systematic validation, allowing anomalies to shape perceptions of perfor ...

Water flows through every aspect of life, yet the story of its delivery is only as reliable as the data that records it. In global benchmarking, such data is often uneven, incomplete, and rarely subjected to systematic validation, allowing anomalies to shape perceptions of performance before they are critically examined. This thesis addresses that gap by developing and evaluating a multi‐stage, data‐driven anomaly detection framework within the World Bank’s New International Benchmarking Network for Water and Sanitation Utilities (NewIBNET), situated at the intersection of data science, water governance, and digital ethics.

The framework weaves together four complementary layers – structural validation, rule‐based logical checks, peer comparison, and weighted prioritisation – transforming anomaly detection from a surface‐level cleaning task into a structured process of active quality assurance. Developed through an iterative, expert‐informed process, it is reproducible and adaptable, balancing statistical rigour with the contextual realities of the water sector so that each flag raised carries both analytical credibility and practical relevance.

Applied to the 2022–2024 NewIBNET dataset, the framework is assessed through robustness checks, a national case study of Indonesian utilities, and an expert survey. Results show that it improves anomaly interpretability, limits the propagation of flawed data into comparative analyses, and reduces review time from 75 hours to under 2 minutes – earning unanimous expert endorsement for operational deployment.

By translating the principles of automated, ethically grounded validation into a scalable methodology, this work advances the state of practice in anomaly detection for data‐scarce sectors. In shifting from red flags to real solutions, it demonstrates how automated validation can turn detection into action, building trust where data meets water, and enabling more transparent, equitable decisions in global water governance.

Annotation Practices in Societally Impactful Machine Learning Applications

What are these automated systems actually trained on?

Bachelor thesis (2025) - S. Lupșa (author) , A.M. Demetriou (mentor) , C.C.S. Liem (mentor) , J. Yang (graduation committee member)

This study examines dataset annotation practices in influential NeurIPS research. Datasets employed in highly cited NeurIPS papers were assessed based on criteria concerning their item population, labelling schema, and annotation process. While high-level information, such as the ...

Annotation Practices in Societally Impactful Machine Learning Applications

What are these automated systems actually trained on?

Bachelor thesis (2025) - D. Košutić (author) , A.M. Demetriou (mentor) , C.C.S. Liem (mentor) , J. Yang (graduation committee member)

The output of machine learning (ML) models can be only as good as the data that is fed into them. Because of this, when making datasets for creating ML models, it is important to ensure the quality of the data. This is especially true of human labeled data, which can be hard to s ...

Impact-based humanitarian forecasting using machine learning for floods

A literature survey

Bachelor thesis (2025) - L. Marcuzzi (author) , M.A.T. Roelvink (mentor) , C.C.S. Liem (mentor) , J. Sun (graduation committee member)

With the worsening of climate change, the complications brought on by floods every year create an increasing need for forecasting systems that humanitarian organizations can use to help populations in danger. This research presents a literature review of machine-learning models f ...

Performance and Feasibility of Machine Learning for Multi-hazard Humanitarian Forecasting

A literature survey

Bachelor thesis (2025) - E. Smura (author) , M.A.T. Roelvink (mentor) , C.C.S. Liem (mentor) , J. Sun (graduation committee member)

Natural disasters frequently cause casualties and property losses. Predicting and mitigating the impact of such threats is crucial to the work of humanitarian organizations. The interactions between hazards are best represented through a multi-hazard approach, and machine learnin ...

How well can machine learning tools for humanitarian forecasting be used in predicting the consequences of forced displacement?

Humanitarian forecasting for displacement: a survey

Bachelor thesis (2025) - L.P. Petrova (author) , M.A.T. Roelvink (mentor) , C.C.S. Liem (mentor) , J. Sun (graduation committee member)

Displacement is a focal point of humanitarian aid efforts, since it affects millions of people globally. Mitigating the consequences of forced migration is important for reducing suffering and one way of doing so is through predicting displacement to prioritise resources in advan ...

Machine learning for humanitarian forecasting: A Survey

Assessing the trustworthiness and real-world feasibility of machine learning models for conflict forecasting

Bachelor thesis (2025) - A. Gavrilă (author) , M.A.T. Roelvink (mentor) , C.C.S. Liem (mentor) , J. Sun (graduation committee member)

As humanitarian needs increase while donor budgets decrease, anticipatory strategies are essential for effective crisis response. In this context, machine learning (ML) has emerged as a promising tool for crisis forecasting, offering the potential to support timely interventions ...

Behind the Labels: Transparency Pitfalls in Annotation Practices for Societally Impactful ML

A deep dive into annotation transparency and consistency in CVPR corpus

Bachelor thesis (2025) - C. Scorţia (author) , A.M. Demetriou (mentor) , C.C.S. Liem (mentor) , J. Yang (graduation committee member)

This study investigates annotation and reporting practices in machine learning (ML) research, focusing on societally impactful applications presented at the IEEE/CVF Computer Vision and Pattern Recognition (CVPR) conferences. By structurally analyzing the 75 most-cited CVPR paper ...

Benchmark Blindspots: A systematic audit of documentation decay in TPAMI’s∗datasets

Bachelor thesis (2025) - A. Despan (author) , A.M. Demetriou (mentor) , C.C.S. Liem (mentor) , J. Yang (graduation committee member)

High-impact vision research still rests on datasets whose labels arrive via opaque, rarely documented pipelines. To understand how serious the problem is inside a large venue, we audited 75 TPAMI papers (2009-2024) that rely or introduce datasets. Each datase ...

Dataset quality within a societally impactful machine learning domain

An overview of data collection and annotation practices of the datasets used by papers published by the ACL

Bachelor thesis (2025) - A. Fazakas (author) , C.C.S. Liem (mentor) , J. Yang (graduation committee member) , A.M. Demetriou (mentor)

This study gives an overview of the data collection and annotation practices of the datasets used by the most impactful papers published by the Association of Computational Linguistics (ACL). This was achieved by selecting the most highly cited papers published within the ACL ant ...

Evaluating Music Improvisation Algorithms with a Modular Trading Fours System

Master thesis (2025) - T. Sjerps (author) , Rafael Bidarra (mentor) , C.C.S. Liem (mentor) , C.A. Raman (graduation committee member)

In musical (jazz) improvisation, musicians that are just starting out can often feel uncomfortable when being put on the spot by their fellow players. However, when a musician is on their own when practising or leisurely playing, this prevents them from listening to fellow musici ...

Requirements Engineering for Machine Learning

A Study in Behavior-Driven Development

Master thesis (2025) - J.M. Rosenberg (author) , C.C.S. Liem (mentor) , A.J. Bartlett (mentor) , C.E. Brandt (graduation committee member)

Machine Learning (ML) systems are increasingly used in high-stakes, socially impactful domains, requiring attention to improve explainability and trust. However, current Requirements Engineering (RE) techniques often fail to address these human-centered qualities. This research i ...

Natural Language Counterfactual Explanations in Financial Text Classification

Master thesis (2024) - K.T. Dobiczek (author) , C.C.S. Liem (mentor) , P. Altmeyer (mentor) , J. Yang (graduation committee member)

Central banks communicate their monetary policy plans to the public through meeting minutes or transcripts. These communications can have immense effects on markets and are often the subjects of studies in the financial literature. The recent advancements in Natural Language Proc ...

Multimodal Context Informed Machine Translation of Manga Using LLMs

Master thesis (2024) - K.P. Skublicki (author) , C. Lofi (mentor) , J. Yang (graduation committee member) , C.C.S. Liem (coach)

Large language models have achieved breakthroughs in many natural language processing tasks. One of their main appeals is the ability to tackle problems that lack sufficient training data to create a dedicated solution. Manga translation is one such task, a still budding and un ...

Do Joint Energy-Based Models Produce More Plausible Counterfactual Explanations?

Bachelor thesis (2024) - G. Pezzali (author) , P. Altmeyer (mentor) , C.C.S. Liem (mentor) , B.J.W. Dudzik (graduation committee member)

Counterfactual explanations (CEs) can be used to gain useful insights into the behaviour of opaque classification models, allowing users to make an informed decision when trusting such systems. Assuming the CEs of a model are faithful (they well represent the inner workings of th ...

Advancing Explainability in Black-Box Models

Bachelor thesis (2024) - İpek İşcan İşcan (author) , P. Altmeyer (mentor) , C.C.S. Liem (mentor) , B.J.W. Dudzik (graduation committee member)

In recent years, the need for explainable artificial intelligence (XAI) has become increasingly important as complex black-box models are used in critical applications. While many methods have been developed to interpret these models, there is also potential in enhancing the mode ...

How Does Predictive Uncertainty Quantification Correlate with the Plausibility of Counterfactual Explanations

Bachelor thesis (2024) - D. Nikolov (author) , P. Altmeyer (mentor) , C.C.S. Liem (mentor) , B.J.W. Dudzik (graduation committee member)

Counterfactual explanations can be applied to algorithmic recourse, which is concerned with helping individuals in the real world overturn undesirable algorithmic decisions. They aim to provide explanations to opaque machine learning models. Not all generated points are equally f ...

Metrics to Ascertain the Plausibility and Faithfulness of Counterfactual Explanations

Bachelor thesis (2024) - A.F. Yücel (author) , P. Altmeyer (mentor) , C.C.S. Liem (mentor) , B.J.W. Dudzik (graduation committee member)

Counterfactual Explanations (CE) are essential for understanding the predictions of black-box models by suggesting minimal changes to input features that would alter the output. Despite their importance in Explainable AI (XAI), there is a lack of standardized metrics to assess th ...

Are Neural Networks Robust to Gradient-Based Adversaries Also More Explainable? Evidence from Counterfactuals

Bachelor thesis (2024) - R. Appachi Senthilkumar (author) , P. Altmeyer (mentor) , C.C.S. Liem (mentor) , B.J.W. Dudzik (graduation committee member)

Adversarial Training has emerged as the most reliable technique to make neural networks robust to gradient-based adversarial perturbations on input data. Besides improving model robustness, preliminary evidence presents an interesting consequence of adversarial training -- increa ...

A Study on Counterfactual Explanations

Investigating the impact of inter-class distance and data imbalance

Master thesis (2024) - I. Zagorac (author) , C.C.S. Liem (mentor) , P. Altmeyer (mentor) , D.M.J. Tax (graduation committee member)

Counterfactual explanations (CEs) are emerging as a crucial tool in Explainable AI (XAI) for understanding model decisions. This research investigates the impact of various factors on the quality of CEs generated for classification tasks. We explore how inter-class distance, data ...