DDL-MVS: Depth Discontinuity Learning for Multi-View Stereo Networks

Journal Article (2023)
Author(s)

N. Ibrahimli (TU Delft - Urban Data Science)

Hugo Ledoux (TU Delft - Urban Data Science)

J.F.P. Kooij (TU Delft - Intelligent Vehicles)

L. Nan (TU Delft - Urban Data Science)

Research Group
Urban Data Science
Copyright
© 2023 N. Ibrahimli, H. Ledoux, J.F.P. Kooij, L. Nan
DOI related publication
https://doi.org/10.3390/rs15122970
More Info
expand_more
Publication Year
2023
Language
English
Copyright
© 2023 N. Ibrahimli, H. Ledoux, J.F.P. Kooij, L. Nan
Research Group
Urban Data Science
Issue number
12
Volume number
15
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

We propose an enhancement module called depth discontinuity learning (DDL) for learning-based multi-view stereo (MVS) methods. Traditional methods are known for their accuracy but struggle with completeness. While recent learning-based methods have improved completeness at the cost of accuracy, our DDL approach aims to improve accuracy while retaining completeness in the reconstruction process. To achieve this, we introduce the joint estimation of depth and boundary maps, where the boundary maps are explicitly utilized for further refinement of the depth maps. We validate our idea by integrating it into an existing learning-based MVS pipeline where the reconstruction depends on high-quality depth map estimation. Extensive experiments on various datasets, namely DTU, ETH3D, “Tanks and Temples”, and BlendedMVS, show that our method improves reconstruction quality compared to our baseline, Patchmatchnet. Our ablation study demonstrates that incorporating the proposed DDL significantly reduces the depth map error, for instance, by more than 30% on the DTU dataset, and leads to improved depth map quality in both smooth and boundary regions. Additionally, our qualitative analysis has shown that the reconstructed point cloud exhibits enhanced quality without any significant compromise on completeness. Finally, the experiments reveal that our proposed model and strategies exhibit strong generalization capabilities across the various datasets.