A comparative study of point clouds semantic segmentation using three different neural networks on the railway station dataset

Journal Article (2021)
Authors

Y. A. Lumban-Gaol (Student TU Delft, Geospatial Information Agency)

Z Chen (Student TU Delft)

M. Smit (Student TU Delft)

X. Li (Student TU Delft)

M. A. Erbaşu (Student TU Delft)

E Verbree (TU Delft - Architecture and the Built Environment, GIS Technologie)

Jesús Balado (GIS Technologie)

B.M. Meijers (GIS Technologie)

C.G. Van Der Vaart (ESRI)

Research Group
GIS Technologie
Copyright
© 2021 Y. A. Lumban-Gaol, Z. Chen, M. Smit, X. Li, M. A. Erbaşu, E. Verbree, J. Balado Frías, B.M. Meijers, C.G. van der Vaart
More Info
expand_more
Publication Year
2021
Language
English
Copyright
© 2021 Y. A. Lumban-Gaol, Z. Chen, M. Smit, X. Li, M. A. Erbaşu, E. Verbree, J. Balado Frías, B.M. Meijers, C.G. van der Vaart
Research Group
GIS Technologie
Issue number
B3-2021
Volume number
43
Pages (from-to)
223-228
DOI:
https://doi.org/10.5194/isprs-archives-XLIII-B3-2021-223-2021
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Point cloud data have rich semantic representations and can benefit various applications towards a digital twin. However, they are unordered and anisotropically distributed, thus being unsuitable for a typical Convolutional Neural Networks (CNN) to handle. With the advance of deep learning, several neural networks claim to have solved the point cloud semantic segmentation problem. This paper evaluates three different neural networks for semantic segmentation of point clouds, namely PointNet++, PointCNN and DGCNN. A public indoor scene of the Amersfoort railway station is used as the study area. Unlike the typical indoor scenes and even more from the ubiquitous outdoor ones in currently available datasets, the station consists of objects such as the entrance gates, ticket machines, couches, and garbage cans. For the experiment, we use subsets from the data, remove the noise, evaluate the performance of the selected neural networks. The results indicate an overall accuracy of more than 90% for all the networks but vary in terms of mean class accuracy and mean Intersection over Union (IoU). The misclassification mainly occurs in the classes of couch and garbage can. Several factors that may contribute to the errors are analyzed, such as the quality of the data and the proportion of the number of points per class. The adaptability of the networks is also heavily dependent on the training location: the overall characteristics of the train station make a trained network for one location less suitable for another.