M. Loog | TU Delft Repository

On Sample-Wise Strict Monotonicity with a Gradient Update

Conference paper (2026) - O. Taylan Turan , Marco Loog , David M.J. Tax

Learning curves describe how the performance of a model evolves with increasing training data. Although more data is generally expected to improve model performance, in practice models can exhibit non-monotonic behavior where additional data leads to performance degradation. Samp ...

Applications and implicit assumptions in dementia risk scores

A scoping review of the LIBRA score

Review (2026) - Wouter M.R. Kant , Wieske K. de Swart , Jim M. Smit , Marco Loog , Jesse H. Krijthe

Dementia risk scores are commonly used tools to estimate the risk of developing Alzheimer's disease and dementia. We lack an overview of what risk scores are used for, what is claimed they ought to be used for, and whether they are suitable for these applications. To address this ...

Generalization performance distributions along learning curves

Journal article (2026) - O. Taylan Turan , Marco Loog , David M.J. Tax

Learning curves show the expected performance with respect to training set size. This is often used to evaluate and compare models, tune hyper-parameters and determine how much data is needed for a specific performance. However, the distributional properties of performance are fr ...

The Vanishing Empirical Variance in Randomly Initialized Deep ReLU Networks

Conference paper (2026) - Michał Grzejdziak-Zdziarski , David M.J. Tax , Marco Loog

Neural networks are typically initialized such that the hidden pre-activations’ theoretical variance remains constant to avoid the vanishing and exploding gradient problem. This condition is necessary to train very deep networks, but numerous analyses show this to be insufficient ...

A comparative study of methods for dynamic survival analysis

Journal article (2025) - Wieske K. de Swart , Marco Loog , Jesse H. Krijthe

Introduction: Dynamic survival analysis has become an effective approach for predicting time-to-event outcomes based on longitudinal data in neurology, cognitive health, and other health-related domains. With advancements in machine learning, several new methods have been introdu ...

Learning Learning Curves

Journal article (2025) - O. Taylan Turan , David M.J. Tax , Tom J. Viering , Marco Loog

Learning curves depict how a model’s expected performance changes with varying training set sizes, unlike training curves, showing a gradient-based model’s performance with respect to training epochs. Extrapolating learning curves can be useful for determining the performance gai ...

An Analysis of Model-Based Reinforcement Learning From Abstracted Observations

Journal article (2023) - R.A.N. Starre , M. Loog , E. Congeduti , F.A. Oliehoek

Many methods for Model-based Reinforcement learning (MBRL) in Markov decision processes (MDPs) provide guarantees for both the accuracy of the model they can deliver and the learning efficiency. At the same time, state abstraction techniques allow for a reduction of the size of a ...

Social Processes

Self-supervised Meta-learning Over Conversational Groups for Forecasting Nonverbal Social Cues

Conference paper (2023) - Chirag Raman , Hayley Hung , Marco Loog

Free-standing social conversations constitute a yet underexplored setting for human behavior forecasting. While the task of predicting pedestrian trajectories has received much recent attention, an intrinsic difference between these settings is how groups form and disband. Eviden ...

The Shape of Learning Curves

A Review

Review (2023) - Tom Viering , Marco Loog

Learning curves provide insight into the dependence of a learner's generalization performance on the training set size. This important tool can be used for model selection, to predict the effect of more training data, and to reduce the computational complexity of model training a ...

LCDB 1.0

An Extensive Learning Curves Database for Classification Tasks

Conference paper (2023) - Felix Mohr , Tom J. Viering , Marco Loog , Jan N. van Rijn

The use of learning curves for decision making in supervised machine learning is standard practice, yet understanding of their behavior is rather limited. To facilitate a deepening of our knowledge, we introduce the Learning Curve Database (LCDB), which contains empirical learnin ...

A View on Model Misspecification in Uncertainty Quantification

Conference paper (2023) - Yuko Kato , David M.J. Tax , Marco Loog

Estimating uncertainty of machine learning models is essential to assess the quality of the predictions that these models provide. However, there are several factors that influence the quality of uncertainty estimates, one of which is the amount of model misspecification. Model m ...

Also for k-means

More data does not imply better performance

Journal article (2023) - Marco Loog , Jesse H. Krijthe , Manuele Bicego

Arguably, a desirable feature of a learner is that its performance gets better with an increasing amount of training data, at least in expectation. This issue has received renewed attention in recent years and some curious and surprising findings have been reported on. In essence ...

Percolate

An Exponential Family JIVE Model to Design DNA-Based Predictors of Drug Response

Conference paper (2023) - Soufiane M.C. Mourragui , Marco Loog , Mirrelijn van Nee , Mark A.van de Wiel , Marcel J.T. Reinders , Lodewyk F.A. Wessels

Motivation: Anti-cancer drugs may elicit resistance or sensitivity through mechanisms which involve several genomic layers. Nevertheless, we have demonstrated that gene expression contains most of the predictive capacity compared to the remaining omic data types. Unfortunately, t ...

Improved Generalization in Semi-Supervised Learning

A Survey of Theoretical Results

Journal article (2022) - Alexander Mey , Marco Loog

Semi-supervised learning is the learning setting in which we have both labeled and unlabeled data at our disposal. This survey covers theoretical results for this setting and maps out the benefits of unlabeled data in classification and regression tasks. Most methods that use unl ...

To Actively Initialize Active Learning

Journal article (2022) - Yazhou Yang , Marco Loog

Though much effort has been spent on designing new active learning algorithms, little attention has been paid to the initialization problem of active learning, i.e., how to find a set of labeled samples which contains at least one instance per category. This work identifies the i ...

Enhancing Classifier Conservativeness and Robustness by Polynomiality

Conference paper (2022) - Ziqi Wang , Marco Loog

We illustrate the detrimental effect, such as overconfident decisions, that exponential behavior can have in methods like classical LDA and logistic regression. We then show how polynomiality can remedy the situation. This, among others, leads purposefully to random-level perform ...

Model-Based Reinforcement Learning with State Abstraction: A Survey

Conference paper (2022) - R.A.N. Starre , M. Loog , F.A. Oliehoek

Model-based reinforcement learning methods are promising since they can increase sample efficiency while simultaneously improving generalizability. Learning can also be made more efficient through state abstraction, which delivers more compact models. Model-based reinforcement le ...

Target Robust Discriminant Analysis

Conference paper (2021) - Wouter M. Kouw , Marco Loog

In practice, the data distribution at test time often differs, to a smaller or larger extent, from that of the original training data. Consequentially, the so-called source classifier, trained on the available labelled data, deteriorates on the test, or target, data. Domain adapt ...

Robust domain-adaptive discriminant analysis

Journal article (2021) - Wouter Kouw , Marco Loog

Consider a domain-adaptive supervised learning setting, where a classifier learns from labeled data in a source domain and unlabeled data in a target domain to predict the corresponding target labels. If the classifier’s assumption on the relationship between domains (e.g. covari ...

A Review of Domain Adaptation without Target Labels

Review (2021) - Wouter M. Kouw , Marco Loog

Domain adaptation has become a prominent problem setting in machine learning and related fields. This review asks the question: How can a classifier learn from a source domain and generalize to a target domain We present a categorization of approaches, divided into, what we refer ...