Featureless

None, None; None, None; None, None; None, None

Featureless

Bypassing feature extraction in action categorization

Conference Paper (2016)

Author(s)

Silvia Pintea (Universiteit van Amsterdam)

Pascal Mettes (Universiteit van Amsterdam)

Jan van Gemert (Universiteit van Amsterdam, TU Delft - Electrical Engineering, Mathematics and Computer Science)

AWM Smeulders (Universiteit van Amsterdam)

Research Group

Pattern Recognition and Bioinformatics

Action recognition Multiclass Waldboost Video representations Feature learning

DOI related publication

https://doi.org/10.1109/ICIP.2016.7532346 Final published version

To reference this document use

https://resolver.tudelft.nl/uuid:c8b86420-aece-4835-a54a-d97457dc8841

More Info

expand_more

Publication Year

2016

Language

English

Research Group

Pattern Recognition and Bioinformatics

Pages (from-to)

196-200

Publisher

IEEE

ISBN (print)

978-1-4673-9962-3

ISBN (electronic)

978-1-4673-9961-6

Event

2016 IEEE International Conference on Image Processing (ICIP) (2016-09-25 - 2016-09-28), Phoenix, United States

Downloads counter

328

Collections

Institutional Repository

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

This method introduces an efficient manner of learning action categories without the need of feature estimation. The approach starts from low-level values, in a similar style to the successful CNN methods. However, rather than extracting general image features, we learn to predict specific video representations from raw video data. The benefit of such an approach is that at the same computational expense it can predict 2D video representations as well as 3D ones, based on motion. The proposed model relies on discriminative Wald-boost, which we enhance to a multiclass formulation for the purpose of learning video representations. The suitability of the proposed approach as well as its time efficiency are tested on the UCF11 action recognition dataset.

Files

1803.06962.pdf

(pdf | 0.622 Mb)

License info not available