Searched for: +
(1 - 13 of 13)
document
Pintea, S. (author), Lin, Y. (author), Dijkstra, Jouke (author), van Gemert, J.C. (author)
A number of computer vision deep regression approaches report improved results when adding a classification loss to the regression loss. Here, we explore why this is useful in practice and when it is beneficial. To do so, we start from precisely controlled dataset variations and data samplings and find that the effect of adding a classification...
conference paper 2023
document
de Boer, Frans (author), van Gemert, J.C. (author), Dijkstra, Jouke (author), Pintea, S. (author)
Activity progress prediction aims to estimate what percentage of an activity has been completed. Currently this is done with machine learning approaches, trained and evaluated on complicated and realistic video datasets. The videos in these datasets vary drastically in length and appearance. And some of the activities have unanticipated...
conference paper 2023
document
Liu, X. (author), van Gemert, J.C. (author), Nejadasl, Fatemeh Karimi (author), Booij, O. (author), Pintea, S. (author)
Objects in videos are typically characterized by continuous smooth motion. We exploit continuous smooth motion in three ways. 1) Improved accuracy by using object motion as an additional source of supervision, which we obtain by anticipating object locations from a static keyframe. 2) Improved efficiency by only doing the expensive feature...
conference paper 2023
document
Lin, Y. (author), Wiersma, R.T. (author), Pintea, S. (author), Hildebrandt, K.A. (author), Eisemann, E. (author), van Gemert, J.C. (author)
Deep learning has improved vanishing point detection in images. Yet, deep networks require expensive annotated datasets trained on costly hardware and do not generalize to even slightly different domains, and minor problem variants. Here, we address these issues by injecting deep vanishing point detection networks with prior knowledge. This...
conference paper 2022
document
Pintea, S. (author), Sharma, S. (author), Vossepoel, F.C. (author), van Gemert, J.C. (author), Loog, M. (author), Verschuur, D.J. (author)
This article investigates bypassing the inversion steps involved in a standard litho-type classification pipeline and performing the litho-type classification directly from imaged seismic data. We consider a set of deep learning methods that map the seismic data directly into litho-type classes, trained on two variants of synthetic seismic data:...
journal article 2021
document
Lin, Y. (author), Pintea, S. (author), van Gemert, J.C. (author)
Current work on lane detection relies on large manually annotated datasets. We reduce the dependency on annotations by leveraging massive cheaply available unlabelled data. We propose a novel loss function exploiting geometric knowledge of lanes in Hough space, where a lane can be identified as a local maximum. By splitting lanes into separate...
conference paper 2021
document
Lelekas, Ioannis (author), Tömen, N. (author), Pintea, S. (author), van Gemert, J.C. (author)
Biological vision adopts a coarse-to-fine information processing pathway, from initial visual detection and binding of salient features of a visual scene, to the enhanced and preferential processing given relevant stimuli. On the contrary, CNNs employ a fine-to-coarse processing, moving from local, edge-detecting filters to more global ones...
conference paper 2020
document
Stahl, Tobias (author), Pintea, S. (author), van Gemert, J.C. (author)
We propose a general object counting method that does not use any prior category information. We learn from local image divisions to predict global image-level counts without using any form of local annotations. Our method separates the input image into a set of image divisions - each fully covering the image. Each image division is composed...
journal article 2019
document
Pintea, S. (author), Zheng, Jian (author), Li, Xilin (author), Bank, Paulina J.M. (author), van Hilten, Jacobus J. (author), van Gemert, J.C. (author)
We focus on the problem of estimating human hand-tremor frequency from input RGB video data. Estimating tremors from video is important for non-invasive monitoring, analyzing and diagnosing patients suffering from motor-disorders such as Parkinson’s disease. We consider two approaches for hand-tremor frequency estimation: (a) a Lagrangian...
conference paper 2019
document
Hommos, Omar (author), Pintea, S. (author), Mettes, Pascal S.M. (author), van Gemert, J.C. (author)
Currently, the most common motion representation for action recognition is optical flow. Optical flow is based on particle tracking which adheres to a Lagrangian perspective on dynamics. In contrast to the Lagrangian perspective, the Eulerian model of dynamics does not track, but describes local changes. For video, an Eulerian phase-based...
conference paper 2019
document
Pintea, S. (author), Liu, Yue (author), van Gemert, J.C. (author)
Knowledge distillation compacts deep networks by letting a small student network learn from a large teacher network. The accuracy of knowledge distillation recently benefited from adding residual layers. We propose to reduce the size of the student network even further by recasting multiple residual layers in the teacher network into a single...
conference paper 2018
document
Zhang, Yichao (author), Pintea, S. (author), van Gemert, J.C. (author)
The ability to amplify or reduce subtle image changes over time is useful in contexts such as video editing, medical video analysis, product quality control and sports. In these contexts there is often large motion present which severely distorts current video amplification methods that magnify change linearly. In this work we propose a method...
conference paper 2017
document
Pintea, S. (author), Mettes, Pascal (author), van Gemert, J.C. (author), Smeulders, AWM (author)
This method introduces an efficient manner of learning action categories without the need of feature estimation. The approach starts from low-level values, in a similar style to the successful CNN methods. However, rather than extracting general image features, we learn to predict specific video representations from raw video data. The benefit...
conference paper 2016
Searched for: +
(1 - 13 of 13)