t-EVA

Time-Efficient t-SNE Video Annotation

Conference Paper (2021)
Author(s)

Soroosh Poorgholi (Student TU Delft)

Osman Semih Kayhan (TU Delft - Pattern Recognition and Bioinformatics)

Jan C. van Gemert (TU Delft - Pattern Recognition and Bioinformatics)

DOI related publication
https://doi.org/10.1007/978-3-030-68799-1_12 Final published version
More Info
expand_more
Publication Year
2021
Language
English
Pages (from-to)
153-169
Publisher
Springer
ISBN (print)
9783030687984
Event
25th International Conference on Pattern Recognition Workshops, ICPR 2020 (2021-01-10 - 2021-01-15), Virtual, Online, Italy
Downloads counter
151

Abstract

Video understanding has received more attention in the past few years due to the availability of several large-scale video datasets. However, annotating large-scale video datasets are cost-intensive. In this work, we propose a time-efficient video annotation method using spatio-temporal feature similarity and t-SNE dimensionality reduction to speed up the annotation process massively. Placing the same actions from different videos near each other in the two-dimensional space based on feature similarity helps the annotator to group-label video clips. We evaluate our method on two subsets of the ActivityNet (v1.3) and a subset of the Sports-1M dataset. We show that t-EVA (https://github.com/spoorgholi74/t-EVA ) can outperform other video annotation tools while maintaining test accuracy on video classification.