TemporalMaxer Performance in the Face of Constraint: A Study in Temporal Action Localization

None, None

TemporalMaxer Performance in the Face of Constraint: A Study in Temporal Action Localization

A Comprehensive Analysis on the Adaptability of TemporalMaxer in Resource-Scarce Environments

Bachelor Thesis (2023)

Author(s)

T. Oprescu (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

J.C. van Gemert – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

A. Lengyel – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

R. Bruintjes – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

O. Strafforello – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

Petr Kellnhofer – Graduation committee member (TU Delft - Computer Graphics and Visualisation)

Faculty

Electrical Engineering, Mathematics and Computer Science

Copyright

Deep Learning Computer Vision TAL Temporal Action Localization Data effciency Compute Efficiency TemporalMaxer

To reference this document use:

https://resolver.tudelft.nl/uuid:8708db58-52e1-45ca-9a34-8f71cf0f256b

More Info

expand_more

Publication Year

2023

Language

English

Copyright

Graduation Date

29-06-2023

Awarding Institution

Delft University of Technology

Project

CSE3000 Research Project

Programme

Computer Science and Engineering

Abstract

This paper presents an analysis of the data and compute efficiency of the TemporalMaxer deep learning model in the context of temporal action localization (TAL), which involves accurately detecting the start and end times of specific video actions. The study explores the performance and scalability of the TemporalMaxer model under limited resources and data availability, focusing on factors such as hardware requirements, training time, and data utilization, thus contributing to the advancement of efficient deep learning models for real-world video tasks. Through a literature review of temporal action recognition models, evaluation of learning curves for data efficiency, and development of metrics to assess the compute efficiency, the study provides insights into the performance trade-offs of the TemporalMaxer model. Experiments conducted on the widely used THUMOS dataset further demonstrate the model's generalizability with limited data, achieving significant accuracy performance with only 50% of the training data. Notably, TemporalMaxer exhibits superior compute efficiency by significantly reducing the number of Multiply-Accumulate operations (MACs) compared to other state-of-the-art models. However, alternative models like TriDet and TadTR outperform TemporalMaxer in training time-constrained scenarios. These findings shed light on the model's practical applicability in resource-constrained environments, offering insights for further optimization and study.

Files

Teodor_Oprescu_Bachelor_Thesis... (pdf)

(pdf | 1.28 Mb)

License info not available