Multi-distribution fitting for multi-view stereo

被引：0

作者：

Jinguang Chen

Zonghua Yu

Lili Ma

Kaibing Zhang

机构：

[1] Xi’an Polytechnic University,School of Computer Science

来源：

Machine Vision and Applications | 2023年 / 34卷

关键词：

Deep learning; Depth estimate; High resolution; Multi-view stereo; Point cloud;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

We propose a multi-view stereo network based on multi-distribution fitting (MDF-Net), which achieves high-resolution depth map prediction with low memory and high efficiency. This method adopts a four-stage cascade structure, which mainly has the following three contributions. First, view cost regularization is proposed to weaken the influence of matching noise on building the cost volume. Second, it is suggested to adaptively calculate the depth refinement interval using multi-distribution fitting (MDF). Gaussian distribution fitting is used to refine and correct depth within a large interval, and then Laplace distribution fitting is used to accurately estimate depth within a small interval. Third, the lightweight image super-resolution network is applied to upsample the depth map in the fourth stage to reduce running time and memory requirements. The experimental results on the DTU dataset indicate that MDF-Net has achieved the most advanced results. It has the lowest memory consumption and running time among the high-resolution reconstruction methods, requiring only approximately 4.29G memory for predicting a depth map with the resolution of 1600 × 1184. In addition, we validate the generalization ability on Tanks and Temples dataset, achieving very competitive performance. The code has been released at https://github.com/zongh5a/MDF-Net.

引用

共 24 条

[1]

Furukawa Y(2010)Accurate, dense, and robust multiview stereopsis IEEE Trans. Pattern Anal. Mach. Intell. 32 1362-1376

[2]

Ponce J(2012)Efficient large-scale multi-view stereo for ultra high-resolution image sets Mach. Vis. Appl. 23 903-920

[3]

Tola E(2023)Visibility-aware multi-view stereo network Int. J. Comput. Vis. 131 199-214

[4]

Strecha C(2016)Large-scale data for multiple-view stereopsis Int. J. Comput. Vis. 120 153-168

[5]

Fua P(2017)Tanks and temples: benchmarking large-scale scene reconstruction ACM Trans. Graph. 36 1-13

[6]

Zhang J(2020)Real-time scene text detection with differentiable binarization AAAI 34 11474-11481

[7]

Yao Y(undefined)undefined undefined undefined undefined-undefined

[8]

Li S(undefined)undefined undefined undefined undefined-undefined

[9]

Luo Z(undefined)undefined undefined undefined undefined-undefined

[10]

Fang T(undefined)undefined undefined undefined undefined-undefined

← 1 2 3 →