Semantic segmentation of mobile laser scanning point clouds with long short-term memory networks

Preliminary results

Journal Article (2021)
Author(s)

J. Balado Frías (TU Delft - GIS Technologie, Universidade de Vigo, Vigo)

PJM Van Oosterom (TU Delft - GIS Technologie)

L. Díaz-Vilarino (TU Delft - GIS Technologie, University of Vigo)

P Arias-Sanchez (Universidade de Vigo, Vigo)

Research Group
GIS Technologie
Copyright
© 2021 J. Balado Frías, P.J.M. van Oosterom, L. Díaz-Vilarino, P. Arias
DOI related publication
https://doi.org/10.5194/isprs-archives-XLIII-B2-2021-123-2021
More Info
expand_more
Publication Year
2021
Language
English
Copyright
© 2021 J. Balado Frías, P.J.M. van Oosterom, L. Díaz-Vilarino, P. Arias
Research Group
GIS Technologie
Issue number
B2-2021
Volume number
43
Pages (from-to)
123-130
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Although point clouds are characterized as a type of unstructured data, timestamp attribute can structure point clouds into scanlines and shape them into a time signal. The present work studies the transformation of the street point cloud into a time signal based on the Z component for the semantic segmentation using Long Short-Term Memory (LSTM) networks. The experiment was conducted on the point cloud of a real case study. Several training sessions were performed changing the Level of Detail of the classification (coarse level with 3 classes and fine level with 11 classes), two levels of network depth and the use of weighting for the improvement of classes with low number of points. The results showed high accuracy, reaching at best 97.3% in the classification with 3 classes (ground, buildings, and objects) and 95.7% with 11 classes. The distribution of the success rates was not the same for all classes. The classes with the highest number of points obtained better results than the others. The application of weighting improved the classes with few points at the expense of the classes with more points. Increasing the number of hidden layers was shown as a preferable alternative to weighting. Given the high success rates and a behaviour of the LSTM consistent with other Neural Networks in point cloud processing, it is concluded that the LSTM is a feasible alternative for the semantic segmentation of point clouds transformed into time signals.