PUNet

None, None; None, None; None, None

PUNet

Temporal Action Proposal Generation With Positive Unlabeled Learning Using Key Frame Annotations

Conference Paper (2021)

Author(s)

Noor ul Sehr Zia (Student TU Delft)

Osman Semih Kayhan (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Jan van Gemert (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Research Group

Pattern Recognition and Bioinformatics

Proposal Generation Action Localization Positive-Unlabeled Learning

DOI related publication

https://doi.org/10.1109/ICIP42928.2021.9506012 Final published version

To reference this document use

https://resolver.tudelft.nl/uuid:65c551db-46c8-42a6-8227-d299f9697f30

More Info

expand_more

Publication Year

2021

Language

English

Research Group

Pattern Recognition and Bioinformatics

Article number

9506012

Pages (from-to)

2598-2602

ISBN (print)

978-1-6654-3102-6

ISBN (electronic)

978-1-6654-4115-5

Event

2021 IEEE International Conference on Image Processing (ICIP) (2021-09-19 - 2021-09-22), Virtual at Anchorage, United States

Downloads counter

153

Abstract

Popular approaches to classifying action segments in long, realistic, untrimmed videos start with high quality action proposals. Current action proposal methods based on deep learning are trained on labeled video segments. Obtaining annotated segments for untrimmed videos is time consuming, expensive and error-prone as annotated temporal action boundaries are imprecise, subjective and inconsistent. By embracing this uncertainty we explore to significantly speed up temporal annotations by using just a single key frame label for each action instance instead of the inherently imprecise start and end frames. To tackle the class imbalance by using only a single frame, we evaluate an extremely simple Positive-Unlabeled algorithm (PU-learning). We demonstrate on THUMOS’14 and ActivityNet that using a single key frame label give good results while being significantly faster to annotate. In addition, we show that our simple method, PUNet
¹, is data-efficient which further reduces the need for expensive annotations.