Evaluation of the SUM-GAN-AAE method for Video Summarization

Bachelor thesis (2021)

Authors

G.D. Trevnenski Electrical Engineering, Mathematics and Computer Science

Contributors

O. Strafforello Pattern Recognition and Bioinformatics - (supervisor 1)

S. Khademi History, Form & Aesthetics - Architecture and the Built Environment (supervisor 2)

T. Höllt Computer Graphics and Visualisation - (coach)

Faculty

Electrical Engineering, Mathematics and Computer Science

Video Summarization Unsupervised learning Rank Correlation Action localization

More Info

expand_more

To reference this document use:

http://resolver.tudelft.nl/uuid:77f7bb22-6979-4807-ab84-2636a39e5923

Published Date

01-07-2021

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

Video summarization is a task which many researchers have tried to automate with deep learning methods. One of these methods is the SUM-GAN-AAE algorithm developed by Apostolidis et al. which is an unsupervised machine learning method evaluated in this study. The research aims at testing the algorithm's performance on the Breakfast dataset, which is an action localization dataset, and evaluate it with rank correlation coefficients. Parameter optimization was performed to tune the learning rate of the system according to the Breakfast dataset. Then, by using k-fold cross-validation, three metrics were used to evaluate the trained model - F-Score, Kendall's τ and Spearman's ρ. Analysis of the results indicates a high F-Score as reported by the SUM-GAN-AAE paper but low rank correlation coefficients. Moreover, plotting importance scores per frame demonstrates the algorithm's inability to select key frames. The findings suggest that F-Score is not a fitting metric to use in the context of video summarization and the SUM-GAN-AAE algorithm performs poorly not only on action localization datasets but also on video summarization ones such as SumMe.

Files

Research_paper_SUM_GAN_AAE_TU_... (.pdf)

(.pdf | 0.389 Mb)