Unreached potentials of RGB-D segmentation

None, None

Unreached potentials of RGB-D segmentation

Master Thesis (2024)

Author(s)

P. Benschop (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Jan van Van Gemert – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

A.S. Gielisse – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

E. Eisemann – Graduation committee member (TU Delft - Computer Graphics and Visualisation)

Faculty

Electrical Engineering, Mathematics and Computer Science

Segmentation RGB-D

To reference this document use:

https://resolver.tudelft.nl/uuid:67e789e4-b51c-4254-84a1-8c8e8653de5f

More Info

expand_more

Publication Year

2024

Language

English

Graduation Date

11-07-2024

Awarding Institution

Delft University of Technology

Programme

['Computer Science']

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

It is commonly believed that image recognition based on RGB improves when using RGB-D, ie: when depth information (distance from the camera) is added. Adding depth should make models more robust to appearance variations in colors and lighting; to recognize shape and spatial relationships while allowing models to ignore irrelevant backgrounds. In this paper we investigate how robust current RGB-D models truly are to changes in appearance, depth, and background where we vary one modality (RGB or depth) and compare RGB-D to RGB-only and depth-only in a semantic segmentation setting. Experiments show that all investigated RGB-D models show some robustness to variations in color, but might severely fail for unseen variations in lighting, spatial position and backgrounds. Our results show that we need new RGB-D models that can exploit the best of both modalities while remaining robust to changes in a single modality.

Files

Master_Thesis_Pascal_Benschop.... (pdf)

(pdf | 9.92 Mb)

License info not available