Semantic Scene Completion using Local Deep Implicit Functions on LiDAR Data

Journal Article (2022)
Authors

C.B. Rist (Mercedes-Benz, TU Delft - Intelligent Vehicles)

David Emmerichs (Mercedes-Benz)

Markus Enzweiler (Esslingen University)

Dariu M. Gavrila (TU Delft - Intelligent Vehicles)

Research Group
Intelligent Vehicles
Copyright
© 2022 C.B. Rist, David Emmerichs, Markus Enzweiler, D. Gavrila
To reference this document use:
https://doi.org/10.1109/TPAMI.2021.3095302
More Info
expand_more
Publication Year
2022
Language
English
Copyright
© 2022 C.B. Rist, David Emmerichs, Markus Enzweiler, D. Gavrila
Research Group
Intelligent Vehicles
Issue number
10
Volume number
44
Pages (from-to)
7205-7218
DOI:
https://doi.org/10.1109/TPAMI.2021.3095302
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Semantic scene completion is the task of jointly estimating 3D geometry and semantics of objects and surfaces within a given extent. This is a particularly challenging task on real-world data that is sparse and occluded. We propose a scene segmentation network based on local Deep Implicit Functions as a novel learning-based method for scene completion. Unlike previous work on scene completion, our method produces a continuous scene representation that is not based on voxelization. We encode raw point clouds into a latent space locally and at multiple spatial resolutions. A global scene completion function is subsequently assembled from the localized function patches. We show that this continuous representation is suitable to encode geometric and semantic properties of extensive outdoor scenes without the need for spatial discretization (thus avoiding the trade-off between level of scene detail and the scene extent that can be covered). We train and evaluate our method on semantically annotated LiDAR scans from the Semantic KITTI dataset. Our experiments verify that our method generates a powerful representation that can be decoded into a dense 3D description of a given scene. The performance of our method surpasses the state of the art on the Semantic KITTI Scene Completion Benchmark in terms of geometric completion intersection-over-union (IoU).

Files

Semantic_Scene_Completion_Usin... (pdf)
(pdf | 1.95 Mb)
- Embargo expired in 01-07-2023
License info not available