J.C. van Gemert | TU Delft Repository

Understanding the Value of Depth: RGB-D Fusion and Pseudo-Depth for Robust Out-of-Distribution Generalisation

An Experimental Journey into How Depth Shapes Generalisation in Vision Models

Master thesis (2026) - Alexandra Neagu , J.C. van Gemert , C.E. Brandt , A.S. Gielisse

Convolutional neural networks (CNNs) trained on RGB images (red, green, blue channels) often exhibit sharp performance degradation under distribution shifts, as they tend to rely on superficial appearance cues such as background or texture. While depth information is known to pro ...

Generative and Regressive Approaches for Periodic Orbit-Based Spacecraft Mission Design in the Circular Restricted Three Body Problem

Master thesis (2026) - J.W.A. Pedra , J.G. De Teixeira da Encarnacao , D.M.J. Tax , J.C. van Gemert , K.J. Cowan

The circular restricted three-body problem is a canonical example of chaotic dynamics and forms the basis of many advanced spacecraft trajectory designs. This thesis investigates whether emerging artificial intelligence based generative and regression methods can reduce computati ...

Low-Rank Ternary Adapters for Fine-Tuning Transformers

Master thesis (2025) - A.D. Manolache , Y. Li , J.C. van Gemert , A. Anand

Parameter-Efficient Fine-Tuning (PEFT) methods for Transformers are designed for floating-point weights. When applied to extremely low-bit models (e.g., ternary {-1,0,1) they convert the base weights to floating point (dequantization) to add the update and then quantize again, wh ...

FlyingDutchman: An Optical Flow Analysis Tool

Master thesis (2025) - P.J.W. Reijalt , A.S. Gielisse , J.C. van Gemert

Much progress in optical flow research has been driven by benchmark datasets. However, these datasets provide only limited feedback on the underlying causes of architectural failures, typically restricted to metrics such as end-point error (EPE), occlusion statistics, and large-d ...

Evacuation Slide Manufacturing Intelligent Vision Quality Inspection System

Master thesis (2025) - K. Dwivedi , J.C. van Gemert , M.M. de Weerdt

Evacuation slides are critical aircraft safety components governed by stringent regulatory standards set by agencies like the Federal Aviation Administration and European Union Aviation Safety Agency. To comply with these standards, the maintenance and repacking of slides, curren ...

Evacuation slides are critical aircraft safety components governed by stringent regulatory standards set by agencies like the Federal Aviation Administration and European Union Aviation Safety Agency. To comply with these standards, the maintenance and repacking of slides, currently performed manually, require operators to follow hundreds of precise steps. This process is labor-intensive, error-prone, and can result in costly delays and safety risks due to human error. Real-time visual inspection systems can help address these challenges, however, a key obstacle for real-world deployment of such systems is the scarcity of benchmark data from aerospace factory operations needed to validate and verify their performance. To enable this, we introduce the first known dataset tailored for evacuation slide inspection, comprising over 14,500 images captured under real and controlled conditions. This data aims to capture slide folding procedures of Embraer AFT evacuation slides, such that developed real-time systems can: (1) estimate the occluded position of the Pressure Relief Valve (PRV), (2) detect context-sensitive foreign objects such as packing clamps, and (3) calculate slide fold dimensions to prevent tolerance stacking errors. From this dataset, five benchmarks were constructed to evaluate performance across the three requirements. Baseline models were developed, including a PRV localization network using LSTMs, a variational autoencoder and object detection pipeline for packing clamp FOD, and a depth and reference-based slide fold measurement calculation method. When tested on our benchmarks, the depth-based measurement estimator showed precision and accuracy, the clamp FOD methodology showed high precision for images taken from specific cameras, however, the PRV position estimation remains a challenge that requires further research. Overall, our results set a foundation for the automation of visual inspection in slide packing and offer benchmarks for future research in this safety-critical inspection task.

Are we SMPLy biased

Identifying ethical biases in Action Recognition

Master thesis (2025) - A. Băltăreţu , J.C. van Gemert , P. Benschop , M. Skrodzki

Human Action Recognition (HAR) models are increasingly deployed in high-stakes environments, yet their fairness across different human appearances has not been analyzed. We introduce a framework for auditing bias in HAR models using synthetic video data, generated with full contr ...

Unsupervised Few-Shot Sample Test-Time Adaptation via Entropy Minimization

Master thesis (2025) - T. Oprescu , J.C. van Gemert , Jorge Abraham Martinez Castaneda

Test-time adaptation methods assume privileged access to model internals: parameters for fine-tuning, statistics for recalibration, or architectural components for modification. This assumption fails when models are deployed as certified systems, encrypted services, or under regu ...

Multitask Learning for Joint Semantic Segmentation and Classification of Ovarian Lesions in Ultrasound Scans

Master thesis (2025) - D.Z. Rogmans , N. Tömen , J.C. van Gemert , R. Guerra Marroquim , J. Dijkstra

Distinguishing between benign and malignant ovarian cysts is a challenging task that depends on subjective visual markers in ultrasound scans. Current manual methods remain prone to costly misdiagnoses and the application of these methods depend heavily on the clinician's level o ...

Spike Time Sensitivity in Spiking Neural Networks

Investigating the Effect of Sample Difficulty in Time-to-First-Spike Coded Spiking Neural Networks

Master thesis (2025) - E. Aydoslu , N. Tömen , O. Booij , J.C. van Gemert , A. Micheli

Spiking neural networks (SNNs) with Time-to-First-Spike (TTFS) coding promise rapid, sparse, and energy-efficient inference. However, the impact of sample difficulty on TTFS dynamics remains underexplored. We investigate (i) how input hardness influences first-spike timing and (i ...

Bridging the Gap: A Real-World Dataset and Evaluation of Optical Flow Models in Large Displacement Scenarios

Bachelor thesis (2025) - M. Timmerije , J.C. van Gemert , A.S. Gielisse , A. Voulimeneas

Optical flow models excel on synthetic benchmarks but can struggle with real-world scenarios involving large displacements, which are critical for applications like autonomous navigation and augmented reality. To address this, we introduce a novel real-world dataset and evaluatio ...

Going Against The Flow

Evaluating Optical Flow Estimation Models on Real-World Non-Rigid Motion

Bachelor thesis (2025) - S. Dahal , A.S. Gielisse , J.C. van Gemert , A. Voulimeneas

Optical flow estimation models are currently trained and evaluated on synthetic datasets. However, the generalizability of these models to real-world applications remains unexplored. This study investigates how well two state-of-the-art optical flow estimation models perform on r ...

Real-world evaluation of Optical Flow on repetitive patterns

Bachelor thesis (2025) - J.B. Klijnsma , J.C. van Gemert , A.S. Gielisse , A. Voulimeneas

Performance of Optical Flow Models on Real-World Occluded Regions

Bachelor thesis (2025) - I.A. Petre , J.C. van Gemert , A.S. Gielisse , A. Voulimeneas

Occlusions are one of the main challenges in optical flow estimation, where parts of the scene are no longer visible between consecutive frames. Several models address this problem, either intrinsically or explicitly, using different strategies. However, most benchmarks rely on s ...

Real-World Evaluation of Optical Flow with Varying Lighting Conditions

Bachelor thesis (2025) - Z. Ge , J.C. van Gemert , A.S. Gielisse , A. Voulimeneas

Optical flow estimation is a core task in computer vision, yet many existing models struggle with lighting-induced appearance changes that are common in real-world scenarios. This work presents a focused evaluation of recent deep learning-based optical flow models under controlle ...

Bringing a Personal Point of View

Evaluating Dynamic 3D Gaussian Splatting for Egocentric Scene Reconstruction

Master thesis (2025) - J. Warchocki , J.C. van Gemert , M. Weinmann

Egocentric video provides a unique view into human perception and interaction, with growing relevance for augmented reality, robotics, and assistive technologies. However, rapid camera motion and complex scene dynamics pose major challenges for 3D reconstruction from this perspec ...

Bringing a Personal Point of View: Evaluating Dynamic 3D Gaussian Splatting for Egocentric Scene Reconstruction

Master thesis (2025) - J. Warchocki , J.C. van Gemert , M. Weinmann

Egocentric video provides a unique view into human perception and interaction, with growing relevance for augmented reality, robotics, and assistive technologies. However, rapid camera motion and complex scene dynamics pose major challenges for 3D reconstruction from this perspec ...

Learning to Count - Algorithmic Counting

Master thesis (2025) - R.A.A. Overwater , J.C. van Gemert , J.F.P. Kooij , Holger Caesar

Visual counting is an important task in computer vision with broad applications in areas such as crowd monitoring, agriculture, and environmental analysis. While deep learning has significantly advanced this field by enabling models to learn robust feature representations, deep l ...

SpeechCAT: Cross-Attentive Transformer for Audio to Motion Generation

Master thesis (2025) - S. Deaconu , X. Zhang , J.C. van Gemert , H. Wang

Audio-to-motion generation is an important task with applications in virtual avatar creation for XR systems and intelligent robot control in daily life scenarios.
Most current motion generation methods depend on a single encoder-decoder architecture to simultaneously model a ...

Automatic Hand Landmark Detection for Leprosy Diagnosis

Comparison of Output Adaptation Techniques for Hand Keypoint Prediction

Bachelor thesis (2025) - M. Tran , T.C. Markhorst , J.C. van Gemert , K. Liang , Z.Y. Lin

Early detection of leprosy, a neglected tropical disease, is crucial to preventing irreversible nerve damage and disability. Analyzing temperature vari- ations in hands using infrared (IR) cameras offers a potential low-cost alternative to existing medical equipment for early det ...

Skin temperature measurement for diagnosing leprosy in Nepal

Automatically measuring localized changes in temperature in the hand using IR-RGB thermography

Bachelor thesis (2025) - D.C. Posthumus , J.C. van Gemert , T.C. Markhorst , Z.Y. Lin , K. Liang

This study investigates sensor technologies for di- agnosing leprosy in Nepal, focussing on skin tem- perature in the hands using contact and non-contact sensors. Leprosy affects the peripheral nervous system, causing thermoregulatory dysfunction de- tectable via localized skin t ...