Fast and Compact Image Segmentation using Instance Stixels

Journal Article (2022)
Authors

Thomas Hehn (TU Delft - Intelligent Vehicles)

Julian Francisco Pieter Kooij (TU Delft - Intelligent Vehicles)

D. M. Gavrila (TU Delft - Intelligent Vehicles)

Research Group
Intelligent Vehicles
Copyright
© 2022 T.M. Hehn, J.F.P. Kooij, D. Gavrila
To reference this document use:
https://doi.org/10.1109/TIV.2021.3067223
More Info
expand_more
Publication Year
2022
Language
English
Copyright
© 2022 T.M. Hehn, J.F.P. Kooij, D. Gavrila
Research Group
Intelligent Vehicles
Issue number
1
Volume number
7
Pages (from-to)
45-56
DOI:
https://doi.org/10.1109/TIV.2021.3067223
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

State-of-the-art stixel methods fuse dense stereo disparity and semantic class information, e.g. from a Convolutional Neural Network (CNN), into a compact representation of driveable space, obstacles and background. However, they do not explicitly differentiate instances within the same semantic class. We investigate several ways to augment single-frame stixels with instance information, which can be extracted by a CNN from the RGB image input. As a result, our novel Instance Stixels method efficiently computes stixels that account for boundaries of individual objects, and represents instances as grouped stixels that express connectivity. Experiments on the Cityscapes dataset demonstrate that including instance information into the stixel computation itself, rather than as a post-processing step, increases the segmentation performance (i.e. Intersection over Union and Average Precision). This holds especially for overlapping objects of the same class. Furthermore, we show the superiority of our approach in terms of segmentation performance and computational efficiency compared to combining the separate outputs of Semantic Stixels and a state-of-the-art pixel-level CNN. We achieve processing throughput of 28 frames per second on average for 8 pixel wide stixels on images from the Cityscapes dataset at 1792x784 pixels. Our Instance Stixels software is made freely available for non-commercial research purposes.

Files

Fast_and_Compact_Image_Segment... (pdf)
(pdf | 4.89 Mb)
- Embargo expired in 18-09-2020
License info not available