Evaluation of Video Summarization Using Fully Convolutional Sequence Networks on Action Localization Datasets

Bachelor thesis (2021)

Authors

P.R. Frolke Electrical Engineering, Mathematics and Computer Science

Contributors

O. Strafforello Pattern Recognition and Bioinformatics - (mentor)

S. Khademi History, Form & Aesthetics - Architecture and the Built Environment (graduation committee member)

T. Höllt Computer Graphics and Visualisation - (coach)

Faculty

Electrical Engineering, Mathematics and Computer Science, Electrical Engineering, Mathematics and Computer Science

To reference this document use:

http://resolver.tudelft.nl/uuid:7b1da021-0216-4977-b339-961960cd2880

More Info

expand_more

Published Date

01-07-2021

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

In the problem of video summarization, the goal is to select a subset of the input frames conveying the most important information of the input video. The collection of data proves to be a challenging task. In part because there exists a disagreement among human annotators on what segments of a video should be considered important for a summary. In this study we analyse a new dataset created with the goal of increasing agreement between the human annotators. The dataset has been created with the use of a novel annotation method, which uses existing action localization labels for segmenting the videos. We train a supervised and an unsupervised deep learning framework on popularly used benchmark datasets and the new dataset. Experimental results show the effectiveness of this novel summary annotation method in improving
the agreement between annotators. Analysis reveals some issues with the evaluation of the deep learning framework.

Files

RP_paper_Paul_Frolke_final.pdf

(.pdf | 9.3 Mb)