Print Email Facebook Twitter Group Equivariant Video Action Recognition Title Group Equivariant Video Action Recognition: Making action-recognition networks equivariant to temporal direction and discrete spatial rotations Author Basu, Debadeep (TU Delft Electrical Engineering, Mathematics and Computer Science) Contributor van Gemert, J.C. (mentor) Lofi, C. (graduation committee) Strafforello, O. (graduation committee) Degree granting institution Delft University of Technology Programme Computer Science Date 2021-12-22 Abstract This work applies the theory of group equivariance to the domain of video action recognition replacing standard 3Dconvolutions with group convolutions which are equivariant to temporal direction, and multiples of 90-degree spatial rotations. We propose a temporal direction symmetry group T2 and extend the standard planar rotations group to three dimensions to form a 3D group that is equivariant to discrete 90-degree spatial rotations. We analyse the efficacy of using these 3D-G-CNNs as drop-in replacements in 3D networks by evaluating synthesized datasets containing handwritten MNIST digits moving over a black background, as well as popular action recognition datasets UCF-101and HMDB-51, and comparing the results against the performance of the standard 3D CNNs on the datasets. Subject Action RecognitionGroup equivarianceComputer VisionDeep Learning To reference this document use: http://resolver.tudelft.nl/uuid:d69313b9-adb0-4440-9b27-24bb2e30bf96 Part of collection Student theses Document type master thesis Rights © 2021 Debadeep Basu Files PDF Group_Equivariant_Video_A ... 089107.pdf 7.55 MB Close viewer /islandora/object/uuid:d69313b9-adb0-4440-9b27-24bb2e30bf96/datastream/OBJ/view