Print Email Facebook Twitter Efficient Video Action Recognition Title Efficient Video Action Recognition: How well does TriDet perform and generalize in a limited compute power and data setting? Author Dămăcuş, Alex (TU Delft Electrical Engineering, Mathematics and Computer Science) Contributor van Gemert, J.C. (mentor) Strafforello, O. (mentor) Bruintjes, R. (mentor) Lengyel, A. (mentor) Kellnhofer, P. (graduation committee) Degree granting institution Delft University of Technology Corporate name Delft University of Technology Programme Computer Science and Engineering Project CSE3000 Research Project Date 2023-06-29 Abstract In temporal action localization, given an input video, the goal is to predict the action that is present in the video, along with its temporal boundaries. Several powerful models have been proposed throughout the years, with transformer-based models achieving state-of-the-art performance in the recent months. Although novel models are becoming more and more accurate, authors rarely study how limited training data or computation power environments affect the performance of their model. This study is carried out on TriDet, a transformer-based temporal action localization model that achieves state-of-the-art performance on two different benchmarks. It evaluates the model’s behavior in a limited training data and computation power environment. It is found that TriDet achieves close to state-of-the-art performance when only 60% of the training data or approximately 90 action instances per class are used. It is also notable that inference time, memory usage, multiply-accumulate operations and GPU utilization scale linearly along with the length of the tensor that is passed to the model. These findings, combined with TriDet’s mean training time of 11 minutes on the THUMOS’14 dataset can be used to determine the model’s hypothetical behavior when run in lower computation power environments. Subject temporal action localizationComputer VisionTriDetTAL To reference this document use: http://resolver.tudelft.nl/uuid:c520548c-f30c-49bc-9058-e31952a15eb2 Part of collection Student theses Document type bachelor thesis Rights © 2023 Alex Dămăcuş Files PDF Alexandru_Damacus_Final_Paper.pdf 3.12 MB Close viewer /islandora/object/uuid:c520548c-f30c-49bc-9058-e31952a15eb2/datastream/OBJ/view