MFNet: Multi-level fusion aware feature pyramid based multi-view stereo network for 3D reconstruction

被引：0

作者：

Youcheng Cai

Lin Li

Dong Wang

Xiaoping Liu

机构：

[1] Hefei University of Technology,The School of Computer Science and Information Engineering

[2] Ministry of Education,The Engineering Research Center of Safety Critical Industrial Measurement and Control Technology

来源：

Applied Intelligence | 2023年 / 53卷

关键词：

Multi-view stereo; Multi-level fusions; Feature pyramid; Group-wise correlation;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

We present an efficient multi-view stereo (MVS) network for 3D reconstruction from multi-view images. While the existing state-of-the-art methods have achieved satisfactory results, the accuracy and scalability remain an open problem due to unreliable dense matching and memory-consuming cost volume regularization. To this end, we propose a multi-level fusion aware feature pyramid based multi-view stereo network (MFNet) for reliable depth inference. First, we adopt a coarse-to-fine strategy that achieves high-resolution depth estimation based on the coarse depth map. This strategy gradually narrows the depth search interval by using the prior information from the previous stage, which dramatically reduces memory consumption. Second, we conduct multi-level fusions to construct the feature pyramid such that the different level features receive information from each other, thus enabling rich multi-level feature representations. Finally, the group-wise correlation similarity measure is introduced to replace the variance-based approach used in previous works for cost volume construction, resulting in a lightweight and effective cost volume representation. Experimental results on the DTU, Tanks & Temples, and BlendedMVS benchmark datasets show that MFNet achieves better results than the state-of-the-art methods.

引用

页码：4289 / 4301

页数：12

共 33 条

[1] Tola E(2012)Efficient large-scale multi-view stereo for ultra high-resolution image sets Mach Vis Appl 23 903-920
[2] Strecha C(2010)Accurate, dense, and robust multiview stereopsis IEEE Trans Pattern Anal Mach Intell 32 1362-1376
[3] Fua P(2016)Large-scale data for multiple-view stereopsis Int J Comput Vis 120 153-168
[4] Furukawa Y(2017)Tanks and temples: benchmarking large-scale scene reconstruction ACM Trans Graph 36 1-13
[5] Aanaes H(2011)Multiview stereo and silhouette consistency via convex functionals over convex domains IEEE Trans Pattern Anal Mach Intell 33 1161-1174
[6] Jensen RR(2016)Detail-preserving and content-aware variational multi-view stereo reconstruction IEEE Transactions on Image Processing 25 864-877
[7] Vogiatzis G(2021)Image robust recognition based on feature-entropy-oriented differential fusion capsule network Appl Intell 51 1108-1117
[8] Tola E(2021)Deep learning in multi-object detection and tracking: state of the art Appl Intell 51 6400-6429
[9] Dahl AB(2020)surfacenet+: an end-to-end 3d neural network for very sparse multi-view stereopsis IEEE Trans Pattern Anal Mach Intell 43 4078-4093
[10] Knapitsch A(undefined)undefined undefined undefined undefined-undefined

← 1 2 3 4 →