Semantically-Guided 3D Building Facade Reconstruction: A Learning-Based MVS Approach

None, None

Semantically-Guided 3D Building Facade Reconstruction: A Learning-Based MVS Approach

Master Thesis (2023)

Author(s)

I. Panagiotidou (TU Delft - Architecture and the Built Environment)

Contributor(s)

N. Ibrahimli – Mentor (TU Delft - Urban Data Science)

Hugo Ledoux – Graduation committee member (TU Delft - Urban Data Science)

S. Wang – Coach (TU Delft - Intelligent Vehicles)

Faculty

Architecture and the Built Environment

Copyright

Deep Learning U-Net Semantic Segmentation Multi-View Stereo Cascaded MVS Network

To reference this document use:

https://resolver.tudelft.nl/uuid:6b3a4d29-45f6-4432-ac31-67ceae7dd75a

More Info

expand_more

Publication Year

2023

Language

English

Copyright

Graduation Date

27-10-2023

Awarding Institution

Delft University of Technology

Programme

['Geomatics']

Faculty

Architecture and the Built Environment

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

This thesis introduces a Learned-Based Multi-View Semantic Stereo method, addressing the limitations of traditional and learned-based Multi-View Stereo (MVS) techniques in reconstructing reflective and low-textured regions, particularly prevalent in 3D models of buildings. Traditional methods lack completeness, while learned-based methods struggle with accuracy. Focusing on enhancing 3D models of buildings, this research integrates semantic information into the existing deep learning architecture for depth prediction, specifically CasMVSNet, to guide the reconstruction process. Three key strategies are employed: first, the incorporation of semantic maps into the network through a multi-modal approach; second, the introduction of a multi-modal refinement module at the end of the CasMVSNet model to improve the initial output depth maps; and third, the introduction of two new loss terms designed to enforce varying degrees of smoothness on specific semantic categories. Experimental results, conducted on the DTU dataset, demonstrate a significant enhancement in accuracy at the point cloud level while maintaining the completeness of the reconstructed models. Validation and generalization on the ETH3D dataset show consistent patterns. This research showcases the potential of integrating semantic guidance in 3D reconstruction of buildings, advancing the field of computer vision.

Files

Ioanna_Panagiotidou_Thesis.pdf

(pdf | 663 Mb)

License info not available

Ioanna_Panagiotidou_Thesis_PPT... (pdf)

(pdf | 5.13 Mb)

License info not available