Evaluation of Video Summarization using DSNet and Action Localization Datasets

None, None

Evaluation of Video Summarization using DSNet and Action Localization Datasets

Bachelor Thesis (2021)

Author(s)

D.H.E. Groenewegen (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

O. Strafforello – Mentor (TU Delft - Electrical Engineering, Mathematics and Computer Science)

S Khademi – Graduation committee member (TU Delft - History, Form & Aesthetics)

T. Höllt – Coach (TU Delft - Computer Graphics and Visualisation)

Faculty

Electrical Engineering, Mathematics and Computer Science

Copyright

Deep learning Action localization dataset Video summarization DSNet Supervised learning

To reference this document use:

https://resolver.tudelft.nl/uuid:f463d54d-de06-4ae3-8106-59d2d4e9353d

More Info

expand_more

Publication Year

2021

Language

English

Copyright

Graduation Date

01-07-2021

Awarding Institution

Delft University of Technology

Project

['CSE3000 Research Project']

Programme

['Computer Science and Engineering']

Abstract

In this paper, the DSNet framework used for automatic video summarization gets reviewed when using action localization datasets. The problem facing video summarizations using deep learning techniques is that datasets can be subjective depending on preferences of human annotators, making for noise in the labeling. This paper will look at a anchor-based approach and anchor-free approach which were introduced by the DSNet framework. More specific it will evaluate in experiments using different hyper-parameters if these approaches gain an increased performances when using action localization datasets instead. These results will show the increase in accuracy when using action localization datasets. Moreover it will compare the different approaches, meaning anchor-based and anchor-free, and see if they still have comparable performance with the method.

Files

Final_paper.pdf

(pdf | 0.375 Mb)

License info not available