L. Cavalcante Siebert | TU Delft Repository

Subspace Learning with Gaussian Processes for Sparse Contextual Bandits

Bachelor thesis (2025) - Y. Chizi (author) , Julia Olkhovskaya (mentor) , L. Siebert (graduation committee member)

The multi-armed bandit problem is a sequential learning scenario in which a learning algorithm seeks to obtain rewards by selecting an arm, or action, in each round, given limited initial knowledge. Contextual bandits present an additional context every round that informs the ban ...

The multi-armed bandit problem is a sequential learning scenario in which a learning algorithm seeks to obtain rewards by selecting an arm, or action, in each round, given limited initial knowledge. Contextual bandits present an additional context every round that informs the bandit algorithm and guides decision-making. While successfully applied in practice, research continues to explore efficient bandit algorithms for high-dimensional bandits with nonparametric, sparsely varying reward functions. One such algorithm is the two-phase SI-BO algorithm, which incorporates an initial subspace learning phase to identify the effective context subspace on which the function varies, and a subsequent Bayesian optimization phase that applies the Gaussian Process-based GP-UCB algorithm to the learned subspace. While the SI-BO offers a theoretical regret performance with weak sub-exponential dependence on the ambient dimension, it is hindered by a high computational cost stemming from the Gaussian Process regression. Building on the algorithm framework introduced in SI-BO, this paper aims to investigate the empirical regret performance of Gaussian Process-based learning algorithms that incorporate subspace learning. To that end, we introduce a novel algorithm, SI-BKB, which combines the subspace learning in SI-BO with the BKB sketching algorithm, reducing computational complexity while maintaining theoretical guarantees. Through synthetic data generation, this paper presents a systematic empirical study on linear and nonlinear bandit environments with varying levels of sparsity. The results demonstrate that the SI-BKB algorithm has comparable regret performance to the SI-BO. Additionally, the regret performance indicates that misalignment of the learned subspace results in suboptimal regret performance during the optimization phase. Moreover, we demonstrate that high sparsity, through subspace misalignment, can improve the regret performance. Repository is available at \url{https://github.com/Cheese-1/SparseSequentialLearning}.

Sparse Sequential Learning

Exploring Stochastic Contextual Linear Bandit and Feature Selection Combinations for Fixed Reduced Dimensions

Bachelor thesis (2025) - V.K. Pasumarthi (author) , Julia Olkhovskaya (mentor) , L. Siebert (graduation committee member)

Stochastic contextual linear bandits are widely used for sequential decision‐making across many domains. However, in high‐dimensional sparse settings, most candidate features are irrelevant to predicting outcomes, and collecting such data is costly. This study examines various SC ...

SPLIT-PO: Sparse Piecewise-Linear Interpretable Tree Policy Optimization

An Interpretable and Differentiable Framework for Sparse-Tree Policy Optimization

Bachelor thesis (2025) - E.M.L. Hellouin de Menibus (author) , A. Lukina (mentor) , D.A. Vos (mentor) , L. Cavalcante Siebert (graduation committee member)

Deep reinforcement learning has shown strong performance in continuous control tasks, but its reliance on deep neural networks (DNNs) hinders interpretability, limiting deployment in safety-critical domains. While recent approaches using differentiable decision trees improve tran ...

Discretising Continuous Action Spaces for Optimal Decision Trees

Verifiable Policies for Continuous Environments in Reinforcement Learning

Bachelor thesis (2025) - M.M.J. van der Kuil (author) , D.A. Vos (mentor) , A. Lukina (graduation committee member) , L. Cavalcante Siebert (graduation committee member)

Complex reinforcement learning (RL) models that receive high rewards in their environments are often hard to understand. To this end, more interpretable models can be used, such as decision trees. To be able to deploy these models in safety-critical environments, they need to be ...

Interpretable Reinforcement Learning for Continuous Action Environments

Extending DTPO for Continuous Action Spaces and Evaluating Competitiveness with RPO

Bachelor thesis (2025) - M.Z. Kaptein (author) , A. Lukina (mentor) , Daniël Vos (mentor) , Luciano Cavalcante Siebert (graduation committee member)

This research addresses the challenge of interpretability in Reinforcement Learning (RL) for environments with continuous action spaces by extending the Decision Tree Policy Optimization (DTPO) algorithm, which was originally developed for discrete action spaces.
Unlike deep ...

Assistance Required: A Qualitative Study of Researcher Needs for AI Research Assistants

Master thesis (2025) - M.J.C. Otten (author) , Jie Yang (mentor) , P.K. Murukannaiah (mentor) , Luciano C. Siebert (graduation committee member)

The use of research assistants has increased significantly, providing support and automation for researchers. However, there is limited research on researchers using research assistants and what assistance researchers require for each research stage.
We interview researchers ...

Risk-sensitive Reinforcement Learning for Portfolio Allocation

Master thesis (2024) - A.A. Sinha (author) , FA Oliehoek (mentor) , Luciano Cavalcante Siebert (graduation committee member) , A. Papapantoleon (graduation committee member) , M.M. Celikok (graduation committee member) , Rob Huisman (graduation committee member)

This study explores the application of risk-sensitive Reinforcement Learning (RL) in portfolio optimization, aiming to integrate asset pricing and portfolio construction into a unified, end-to-end RL framework. While RL has shown promise in various domains, its traditional risk-n ...

Exploring Domain Adaptation for Floor Plan Vectorization

Master thesis (2024) - J.L. Hofland (author) , J.C. Van Gemert (mentor) , S. Khademi (graduation committee member) , C.C.J. van Engelenburg (graduation committee member) , Luciano Siebert (graduation committee member) , Ricardo Jongerius (graduation committee member)

This paper explores the challenges of converting architectural floor plans from raster to vector images. Unlike previous studies, our research focuses on domain adaptation to address stylistic and technical variations across different floor plan datasets. We develop and test our ...

Augment it Maybe?

Improving Deep Vision Models with Adversarial Scene Text Augmentation

Master thesis (2023) - A. Sharma (author) , J.C. van Gemert (mentor) , Luciano Cavalcante Siebert (graduation committee member)

Image data augmentation has been regarded as a reliable and effective way to increase the data available for training. With the advent and rise of Generative AI, generative data augmentation has been shown to realize even better gains in performance for downstream tasks. However, ...

Image data augmentation has been regarded as a reliable and effective way to increase the data available for training. With the advent and rise of Generative AI, generative data augmentation has been shown to realize even better gains in performance for downstream tasks. However, these performance gains are often the cause of "extra information" being seeped into the generated examples via pre-trained model weights, heuristic inclusions etc. In this paper, we showcase the impact of text-in-image augmentation on the performance of an underlying downstream task (classification or recognition). This study specifically looks at the difference in performance when training a classifier under three settings- no augmentation, transform-based augmentation, and generative augmentation- and investigate whether and where this augmentation can be successfully employed to experience gains in performance, without letting any "extra information" seep in. We try to observe this difference in performance under varying amounts of training samples, and for samples with varying similarities to that of the original training data. We also present a new GAN structure- conditional Classification Deep Convolutional GAN (or the CcGAN)- as an improved baseline over the conditional Deep Convolutional GAN (cDCGAN) for our experiments which gave a 4\% performance gain over unaugmented data with no 'extra information'. We find that in certain settings and examples, there exists a performance advantage to train vision models in text-in-image settings using real and generated data. We also confirm that the amount of original training samples available affect the test accuracy achieved by generative augmentation, where a huge fall-off can be seen in extremely low- and high- data regimes; however, it seems to maximize performance at a ”sweet spot” where the robustness and variability added by the generated samples help to realize performance gains. It was also observed that the 1x and 5x augmentations performed better than other configurations. Lastly, we find that the similarity of generations does not affect model performance and does not vary consistently with model performance for most settings.

How Emotional Expressiveness Affects Trust Formation in a Conversational Decision Support System

Master thesis (2023) - Liang Zhang (author) , Ujwal Gadiraju (mentor) , G. He (mentor) , Luciano Cavalcante Cavalcante Siebert (graduation committee member)

Trust is a fundamental component in human-AI relationships, serving as a critical element of user acceptance and satisfaction, particularly within the realm of Decision Support Systems (DSS). The technological advances in conversational user interfaces (CUIs) such as ChatGPT and ...

Trust is a fundamental component in human-AI relationships, serving as a critical element of user acceptance and satisfaction, particularly within the realm of Decision Support Systems (DSS). The technological advances in conversational user interfaces (CUIs) such as ChatGPT and digital assistants (e.g., Alexa) allow laypeople to interact with DSS without knowing the mechanisms behind them.
Extensive research has explored the benefits of CUIs and strives to improve their usability and adoption rate. However, while interacting with such CUIs, how to facilitate proper user trust for decision support is still under-explored. To address the research gap, we aim to test the impact of emotional expressiveness in CUIs on building user trust.

To analyze the impact of emotional expressiveness in CUIs to build user trust and whether voice-based CUI is more efficient in building user trust compared to text-based CUI. We implemented a conversational interface with varying emotional expressiveness that can serve six conditions: two text-based and four voice-based. Text-based CUIs are differentiated by lexical expressiveness. Voice-based CUIs are varying in both lexical expressiveness and prosodic expressiveness. Regardless of the modality and emotional expressiveness, each CUI serves as an interactive medium for users with the DSS, which supports them to find a suitable house given a scenario.

Through an empirical study (N = 151), the experimental results are insufficient to conclude the impact of prosodic expressiveness and lexical expressiveness on user trust and usability in CUIs. In addition, we did not find any statistically significant difference between text-based and voice-based CUIs in trust or perceived usability.

Our findings can potentially be explained by the uncanniness effect [46]: initially, increased emotional expressiveness in a chatbot could positively influence user trust, but over time this could turn into a negative impact. These results offer a potential way to explain the complex dynamics of trust in conversational DSS and some implications in chatbot design within the context of DSS. Our findings
can benefit the future design and development of conversational agents-based DSS by considering emotional expressiveness.

Partial Hierarchy Appliance Modelling In Household Energy Consumption

Utilizing ARMA based methods to improve the prediction of household energy consumption

Bachelor thesis (2023) - M.A.A. Kienhuis (author) , Sietze Kai Kuilman (mentor) , Mathijs De Weerdt (graduation committee member) , Luciano Siebert (graduation committee member)

The ever-evolving power grid is becoming smarter and smarter. Modern houses come with smart meters and energy conscious consumers will buy additional smart meters to place in their home to help monitor their energy consumption. This new smart technology also opens the door to mor ...

Participatory AI in Marginalized Communities

Exploring Strategies for Inclusive Stakeholder Engagement in Algorithmic Development

Master thesis (2023) - C. El Moussaoui (author) , Cynthia C.S. Liem (mentor) , L. Siebert (graduation committee member)

In today's society, the rapid progression of digitization has led to the automation of various facets of human existence. This transformation has been facilitated by the utilization of algorithms, which are instrumental in driving efficient and effective automated processes. Thes ...

Constructing and Evaluating Complex Event-based Datasets for Increasing Performance of Instance Segmentation Models

Bachelor thesis (2022) - A.D. Manolache (author) , Nergis Tomen (mentor) , O. Strafforello (mentor) , X. Liu (mentor) , Luciano Siebert (graduation committee member)

Event-based cameras represent a new alternative to traditional frame based sensors, with advantages in lower output bandwidth, lower latency and higher dynamic range, thanks to their independent, asynchronous pixels. These advantages prompted the development of computer vision me ...

Accuracy-efficiency trade-off for using event-based data when performing bounding box-based object detection

Bachelor thesis (2022) - P. Benschop (author) , Nergis Tomen (mentor) , O. Strafforello (mentor) , X. Liu (mentor) , Luciano Siebert (graduation committee member)

Event-based cameras do not capture frames like an RGB camera, only data from pixels that detect a change in light intensity, making it a better alternative for processing videos. The sparse data acquired from event-based video only captures movement in an asynchronous way. In thi ...

Analysis of object tracking algorithms performance on event-based datasets

Bachelor thesis (2022) - A.C. Olaru (author) , Nergis Tomen (mentor) , O. Strafforello (mentor) , X. Liu (mentor) , Luciano Siebert (graduation committee member)

The event-based camera represents a revolutionary concept, having an asynchronous output. The pixels of dynamic vision sensors react to the brightness change, resulting in streams of events at very small intervals of time. This paper provides a model to track objects in neuromorp ...

EV-Mask-RCNN: Instance Segmentation in Event-based Videos

Bachelor thesis (2022) - A. Băltăreţu (author) , Nergis Tomen (mentor) , O. Strafforello (mentor) , X. Liu (mentor) , Luciano Siebert (graduation committee member)

Instance segmentation on data from Dynamic Vision Sensors (DVS) is an important computer vision task that needs to be tackled in order to push the research forward on these types of inputs. This paper aims to show that deep learning based techniques can be used to solve the task ...

To Err Is AI! Debugging as an Intervention to Facilitate Appropriate Reliance on AI Systems

Master thesis (2022) - A.R.J. Bharos (author) , U.K. Gadiraju (mentor) , G. He (graduation committee member) , Geert Jan Houben (graduation committee member) , Luciano Cavalcante Siebert (graduation committee member)

Powerful predictive AI systems have demonstrated great potential in augmenting human decision-making. Recent empirical work has argued that the vision for optimal human-AI collaboration requires ‘appropriate reliance’ of humans on AI systems. However, accurately estimating the tr ...

KPI by proxy

Master thesis (2021) - M.J.W. van den Hoek (author) , Sebastian Proksch (mentor) , A. van van Deursen (graduation committee member) , Elvan Kula (graduation committee member) , L. Siebert (graduation committee member)

Metrics are widely used in the software engineering industry and can serve as Key Performance Indicators (KPIs), which are used by management to make informed decisions and understand the performance of the organisation. Many companies measure themselves against industry-standard ...

Performance analysis of interest point detection/matching on shiny and non-textured surfaces

Bachelor thesis (2021) - R.M. Huizer (author) , Jan van van Gemert (mentor) , B. Yildiz (mentor) , Luciano Cavalcante Cavalcante Siebert (graduation committee member)

3D modeling techniques can be used to automate processes such as damage assessment in aircraft engines. Aircraft engines often have shiny and non-textured surfaces, where these modeling techniques often have poor performance. This paper gives more insight into the performance of ...

Performance analysis of Simultaneous Localization and Mapping to reconstruct aircraft engines in 3D

Bachelor thesis (2021) - T.C. Markhorst (author) , Jan van van Gemert (mentor) , Burak Yildiz (mentor) , Luciano Cavalcante Siebert (graduation committee member)

Proper maintenance and inspection of aircraft and their engines is important for society. These engine inspections are performed using borescopes of which the footage is manually analysed. Having the opportunity to reconstruct a 3D model of the rotors would ease the inspection an ...