Motion representations for privacy-aware cross-domain action recognition

None, None; None, None; None, None; None, None

Motion representations for privacy-aware cross-domain action recognition

Journal Article (2026)

Author(s)

P. Benschop (National Policelab AI & Model-Driven Decisions Lab, TU Delft - Electrical Engineering, Mathematics and Computer Science)

J.C. van Gemert (TU Delft - Electrical Engineering, Mathematics and Computer Science)

J.P. Mense (National Policelab AI & Model-Driven Decisions Lab)

J.H.G. Dauwels (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Research Group

Signal Processing Systems

Action recognition Privacy Domain adaptation Optical flow Cross-domain

DOI related publication

https://doi.org/10.3389/fimag.2026.1846329 Final published version

To reference this document use

https://resolver.tudelft.nl/uuid:f273edb4-0992-4a7e-9014-16e7d1f5dee0

More Info

expand_more

Publication Year

2026

Language

English

Research Group

Signal Processing Systems

Journal title

Frontiers in Imaging

Volume number

5

Downloads counter

11

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Video captured for action recognition often contains sensitive appearance cues such as faces, skin color, and clothing. Models trained on such data may exploit these cues rather than the underlying motion, raising privacy concerns in real-world deployment. In this work, we study action recognition under a motion-focused constraint: the model receives only motion representations that capture pixel displacement over time, while reducing appearance cues that expose identity or scene context. We focus on motion-history images and optical flow as learning-free representations that reduce identifiable appearance information while retaining action recognition accuracy. Our motion I3D model achieves approximately 31% and 52% zero-shot top-1 accuracy on HMDB-51 and UCF-101, respectively, outperforming non-CLIP direct-transfer baselines trained on Kinetics-400 despite operating without any appearance input. In 16-shot adaptation, the same model reaches 52% and 83% top-1 accuracy. In the domain adaptation setting on TP-HMDB↔TP-UCF, our motion-focused models achieve higher action recognition accuracy than prior privacy-preserving methods. Sensitive attribute predictability is reduced relative to RGB by a comparable margin, without requiring a learned privacy filter. On PA-HMDB51, optical flow is the strongest motion representation for privacy preservation, approaching chance level for skin-color prediction and remaining below RGB on most privacy attributes, indicating that motion representations retain useful action information while exposing less personal information.

Files

Fimag-5-1846329.pdf

(pdf | 0.819 Mb)