- document
-
Haarman, Luuk (author)Convolutional Neural Networks (CNNs) benefit from fine-grained details in high-resolution images, but these images are not always easily available as data collection can be expensive or time-consuming. Transfer learning pre-trains models on data from a related domain before fine-tuning on the main domain, and is a common strategy to deal with...master thesis 2023
- document
-
Masouris, Thanos (author)Chess recognition refers to the task of identifying the chess pieces configuration from a chessboard image. Contrary to the predominant approach that aims to solve this task through the pipeline of chessboard detection, square localization, and piece classification, we rely on the power of deep learning models and introduce two novel...master thesis 2023
- document
-
Sharma, Anirvin (author)Image data augmentation has been regarded as a reliable and effective way to increase the data available for training. With the advent and rise of Generative AI, generative data augmentation has been shown to realize even better gains in performance for downstream tasks. However, these performance gains are often the cause of "extra information"...master thesis 2023
- document
-
Lieuw A Soe, Devin (author)This paper studies the effect of integrating color equivariance and invariance into object detection, in particular into the Faster R-CNN architecture. To better understand the influence of this integration, we introduce modifications to the traditional convolutional layers of the standard Faster R-CNN model. By employing group theory in a...master thesis 2023
- document
-
Mullaj, Dajt (author)Deep convolutional neural networks (CNNs) have achieved current state-of-the-art in image denoising, but require large datasets for training. Their performance remains limited on smaller real-noise datasets. In this paper, we investigate robust deep learning denoising using transfer learning. We explore the impact of dataset sizes, CNN parameter...master thesis 2023
- document
-
Markhorst, Thomas (author)In this paper, we combine image denoising and classification, aiming to enhance human perception of noisy images captured by edge devices, like security cameras. Since edge devices have little computational power, we also optimize for efficiency by proposing a novel architecture that integrates the two tasks. Additionally, we alter a Neural...master thesis 2023
- document
-
WANG, HAORAN (author)In contrast to the prevalent focus on real photos in computer vision research, we present a contribution by making the Ot & Sien dataset machine learning-ready for object detection tasks in illustrations. We refer to the new dataset as Ot & Sien++ that is composed of scanned images of children’s book illustrations, thereby venturing into...master thesis 2023
- document
-
Lin, Zhi-Yi (author)Human 3D kinematics estimation involves measuring joint angles and body segment scales to quantify and analyze the mechanics of human movements. It has applications in areas such as injury prevention, disease identification, and sports science. Conventional marker-based motion capture methods are expensive both in terms of financial investment...master thesis 2023
- document
-
Hoonhout, Douwe (author)Video temporal action localization is the task of identifying and localizing specific actions or activities within a video stream. Instead of only classifying which actions occur in the video stream, we aim to detect when an action begins and ends. In this work, we focus on solving this task without any supervision. Existing unsupervised methods...master thesis 2023
- document
-
de Boer, Frans (author)In this paper, we investigate the behaviour of current progress prediction methods on the currently used benchmark datasets. We show that the progress prediction methods can fail to extract useful information from visual data on these datasets. Moreover, when the methods fail to extract visual information, memory-based methods adopt a frame...master thesis 2023
- document
-
Dămăcuş, Alex (author)In temporal action localization, given an input video, the goal is to predict the action that is present in the video, along with its temporal boundaries. Several powerful models have been proposed throughout the years, with transformer-based models achieving state-of-the-art performance in the recent months. Although novel models are becoming...bachelor thesis 2023
- document
-
Wang, Yunhan (author)Temporal Action Localization (TAL) aims to localize the start and end times of actions in untrimmed videos and classify the corresponding action types. TAL plays an important role in understanding video. Existing TAL approaches heavily rely on deep learning and require large-scale data and expensive training processes. Recent advances in...bachelor thesis 2023
- document
-
Warchocki, Jan (author)In temporal action localization, given an input video, the goal is to predict which actions it contains, where they begin and where they end. Training and testing current state-of-the-art, deep learning models is done assuming access to large amounts of data and computational power. Gathering such data is however a challenging task and access to...bachelor thesis 2023
- document
-
Oprescu, Teodor (author)This paper presents an analysis of the data and compute efficiency of the TemporalMaxer deep learning model in the context of temporal action localization (TAL), which involves accurately detecting the start and end times of specific video actions. The study explores the performance and scalability of the TemporalMaxer model under limited...bachelor thesis 2023
- document
-
Misterka, Paul (author)Temporal Action Localization (TAL) is an important problem in computer vision with uses in video surveillance and recommendation, healthcare, entertainment, and human-computer interaction. Being an inherently data-heavy process, TAL has been bound by the availability of computing power, resulting in its slow pace of innovation. This work aims to...bachelor thesis 2023
- document
-
El Coudi El Amrani, Nafie (author)Neural Radiance Fields (NeRFs) have demonstrated remarkable capabilities in photo-realistic 3D reconstruction. NeRFs often take as input posed images where the camera poses come from either off-the-shelf S\textit{f}M or online optimization together with NeRFs. However, we find that both strategies yield suboptimal results in recovering camera...master thesis 2023
- document
-
Li, Xinqi (author)Quantitative cardiac MRI is an increasingly important diagnostic tool for cardiovascular diseases. Yet, it is essential to have correct image registration for good accuracy and precision of quantitative mapping. Registering all baseline images from a quantitative cardiac MRI sequence, however, is nontrivial because the patient is moving, leading...master thesis 2023
- document
-
Kam, Berend (author)<br/>Machine learning algorithms (learners) are typically expected to produce monotone learning curves, meaning that their performance improves as the size of the training dataset increases. However, it is important to note that this behavior is not universally observed. Recently monotonicity of learning curves has gained renewed attention, as...master thesis 2023
- document
-
Gielisse, A.S. (author)Most recent works on optical flow use convex upsampling as the last step to obtain high-resolution flow. In this work, we show and discuss several issues and limitations of this currently widely adopted convex upsampling approach. We propose a series of changes, inspired by the observation that convex upsampling as currently implemented performs...master thesis 2023
- document
-
Gökçe, Tolga (author)An algal bloom is defined as a rapid increase in common algae (phytoplankton) abundance in water bodies and it can occur when a group of certain environmental factors is combined. If the algae populations grow out of control, such algal blooms become problematic and cause damage to the ecosystem, such phenomena are called harmful algal blooms....bachelor thesis 2023