Search results | TU Delft Repositories

document

A step towards understanding why classification helps regression

Pintea, S. (author), Lin, Y. (author), Dijkstra, Jouke (author), van Gemert, J.C. (author)

A number of computer vision deep regression approaches report improved results when adding a classification loss to the regression loss. Here, we explore why this is useful in practice and when it is beneficial. To do so, we start from precisely controlled dataset variations and data samplings and find that the effect of adding a classification...

conference paper 2023

document

Is there progress in activity progress prediction?

de Boer, Frans (author), van Gemert, J.C. (author), Dijkstra, Jouke (author), Pintea, S. (author)

Activity progress prediction aims to estimate what percentage of an activity has been completed. Currently this is done with machine learning approaches, trained and evaluated on complicated and realistic video datasets. The videos in these datasets vary drastically in length and appearance. And some of the activities have unanticipated...

conference paper 2023

document

Objects do not disappear: Video object detection by single-frame object location anticipation

Liu, X. (author), van Gemert, J.C. (author), Nejadasl, Fatemeh Karimi (author), Booij, O. (author), Pintea, S. (author)

Objects in videos are typically characterized by continuous smooth motion. We exploit continuous smooth motion in three ways. 1) Improved accuracy by using object motion as an additional source of supervision, which we obtain by anticipating object locations from a static keyframe. 2) Improved efficiency by only doing the expensive feature...

conference paper 2023

document

Deep Vanishing Point Detection: Geometric priors make dataset variations vanish

Lin, Y. (author), Wiersma, R.T. (author), Pintea, S. (author), Hildebrandt, K.A. (author), Eisemann, E. (author), van Gemert, J.C. (author)

Deep learning has improved vanishing point detection in images. Yet, deep networks require expensive annotated datasets trained on costly hardware and do not generalize to even slightly different domains, and minor problem variants. Here, we address these issues by injecting deep vanishing point detection networks with prior knowledge. This...

conference paper 2022

document

Seismic inversion with deep learning: A proposal for litho-type classification

Pintea, S. (author), Sharma, S. (author), Vossepoel, F.C. (author), van Gemert, J.C. (author), Loog, M. (author), Verschuur, D.J. (author)

This article investigates bypassing the inversion steps involved in a standard litho-type classification pipeline and performing the litho-type classification directly from imaged seismic data. We consider a set of deep learning methods that map the seismic data directly into litho-type classes, trained on two variants of synthetic seismic data:...

journal article 2021

document

Semi-Supervised Lane Detection With Deep Hough Transform

Lin, Y. (author), Pintea, S. (author), van Gemert, J.C. (author)

Current work on lane detection relies on large manually annotated datasets. We reduce the dependency on annotations by leveraging massive cheaply available unlabelled data. We propose a novel loss function exploiting geometric knowledge of lanes in Hough space, where a lane can be identified as a local maximum. By splitting lanes into separate...

conference paper 2021

document

Top-down networks: A coarse-to-fine reimagination of CNNs

Lelekas, Ioannis (author), Tömen, N. (author), Pintea, S. (author), van Gemert, J.C. (author)

Biological vision adopts a coarse-to-fine information processing pathway, from initial visual detection and binding of salient features of a visual scene, to the enhanced and preferential processing given relevant stimuli. On the contrary, CNNs employ a fine-to-coarse processing, moving from local, edge-detecting filters to more global ones...

conference paper 2020

document

Divide and Count: Generic Object Counting by Image Divisions

Stahl, Tobias (author), Pintea, S. (author), van Gemert, J.C. (author)

We propose a general object counting method that does not use any prior category information. We learn from local image divisions to predict global image-level counts without using any form of local annotations. Our method separates the input image into a set of image divisions - each fully covering the image. Each image division is composed...

journal article 2019

document

Hand-tremor frequency estimation in videos

Pintea, S. (author), Zheng, Jian (author), Li, Xilin (author), Bank, Paulina J.M. (author), van Hilten, Jacobus J. (author), van Gemert, J.C. (author)

We focus on the problem of estimating human hand-tremor frequency from input RGB video data. Estimating tremors from video is important for non-invasive monitoring, analyzing and diagnosing patients suffering from motor-disorders such as Parkinson’s disease. We consider two approaches for hand-tremor frequency estimation: (a) a Lagrangian...

conference paper 2019

document

Using Phase Instead of Optical Flow for Action Recognition

Hommos, Omar (author), Pintea, S. (author), Mettes, Pascal S.M. (author), van Gemert, J.C. (author)

Currently, the most common motion representation for action recognition is optical flow. Optical flow is based on particle tracking which adheres to a Lagrangian perspective on dynamics. In contrast to the Lagrangian perspective, the Eulerian model of dynamics does not track, but describes local changes. For video, an Eulerian phase-based...

conference paper 2019

document

Recurrent Knowledge Distillation

Pintea, S. (author), Liu, Yue (author), van Gemert, J.C. (author)

Knowledge distillation compacts deep networks by letting a small student network learn from a large teacher network. The accuracy of knowledge distillation recently benefited from adding residual layers. We propose to reduce the size of the student network even further by recasting multiple residual layers in the teacher network into a single...

conference paper 2018

document

Video Acceleration Magnification

Zhang, Yichao (author), Pintea, S. (author), van Gemert, J.C. (author)

The ability to amplify or reduce subtle image changes over time is useful in contexts such as video editing, medical video analysis, product quality control and sports. In these contexts there is often large motion present which severely distorts current video amplification methods that magnify change linearly. In this work we propose a method...

conference paper 2017

document

Featureless: Bypassing feature extraction in action categorization

Pintea, S. (author), Mettes, Pascal (author), van Gemert, J.C. (author), Smeulders, AWM (author)

This method introduces an efficient manner of learning action categories without the need of feature estimation. The approach starts from low-level values, in a similar style to the successful CNN methods. However, rather than extracting general image features, we learn to predict specific video representations from raw video data. The benefit...

conference paper 2016