EV-Mask-RCNN: Instance Segmentation in Event-based Videos

None, None

EV-Mask-RCNN: Instance Segmentation in Event-based Videos

Bachelor Thesis (2022)

Author(s)

A. Băltăreţu (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Nergis Tömen – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

Ombretta Strafforello – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

X. Liu – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

Luciano Siebert – Graduation committee member (TU Delft - Interactive Intelligence)

Faculty

Electrical Engineering, Mathematics and Computer Science

Copyright

Event Cameras Instance Segmentation Dynamic Vision Sensor Mask R-CNN

To reference this document use:

https://resolver.tudelft.nl/uuid:5ed706db-509c-4a0c-96e4-64ad2be7ddca

More Info

expand_more

Publication Year

2022

Language

English

Copyright

Graduation Date

22-06-2022

Awarding Institution

Delft University of Technology

Project

CSE3000 Research Project

Programme

Computer Science and Engineering

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Instance segmentation on data from Dynamic Vision Sensors (DVS) is an important computer vision task that needs to be tackled in order to push the research forward on these types of inputs. This paper aims to show that deep learning based techniques can be used to solve the task of instance segmentation on DVS data. A high performing model was used to solve this task, using event-based data that was transformed into RGB-D images. The chosen model for this work was Mask R-CNN, with an alteration for depth images, because of its high performance on frame based data. The N-MNIST dataset provides the event-based input, and the transformation of such an input is presented in this study. Furthermore, the masks are generated with the help of the MNIST dataset and heuristics are used for placing them at the correct positions. The results are promising and comparable to other results from literature on the task of semantic segmentation.

Files

EV_Mask_RCNN_Instance_Segmenta... (pdf)

(pdf | 2.5 Mb)

License info not available