EMO-MVS: Error-Aware Multi-Scale Iterative Variable Optimizer for Efficient Multi-View Stereo

被引:12
作者
Zhou, Huizhou [1 ]
Zhao, Haoliang [2 ]
Wang, Qi [1 ,2 ]
Lei, Liang [1 ,3 ]
Hao, Gefei [2 ]
Xu, Yusheng [4 ]
Ye, Zhen [4 ]
机构
[1] Guangdong Univ Technol, Sch Phys & Optoelect Engn, Guangzhou 510000, Peoples R China
[2] Guizhou Univ, State Key Lab Publ Big Data, Guiyang 550000, Peoples R China
[3] Guangdong Prov Key Lab Informat Photon Technol, Guangzhou 510000, Peoples R China
[4] Tongji Univ, Coll Surveying & Geoinformat, Shanghai 200000, Peoples R China
基金
中国国家自然科学基金;
关键词
multi-view stereo; 3D reconstruction; depth estimation; stereo vision; RECONSTRUCTION;
D O I
10.3390/rs14236085
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Efficient dense reconstruction of objects or scenes has substantial practical implications, which can be applied to different 3D tasks (for example, robotics and autonomous driving). However, because of the expensive hardware required and the overall complexity of the all-around scenarios, efficient dense reconstruction using lightweight multi-view stereo methods has received much attention from researchers. The technological challenge of efficient dense reconstruction is maintaining low memory usage while rapidly and reliably acquiring depth maps. Most of the current efficient multi-view stereo (MVS) methods perform poorly in efficient dense reconstruction, this poor performance is mainly due to weak generalization performance and unrefined object edges in the depth maps. To this end, we propose EMO-MVS, which aims to accomplish multi-view stereo tasks with high efficiency, which means low-memory consumption, high accuracy, and excellent generalization performance. In detail, we first propose an iterative variable optimizer to accurately estimate depth changes. Then, we design a multi-level absorption unit that expands the receptive field, which efficiently generates an initial depth map. In addition, we propose an error-aware enhancement module, enhancing the initial depth map by optimizing the projection error between multiple views. We have conducted extensive experiments on challenging datasets Tanks and Temples and DTU, and also performed a complete visualization comparison on the BlenedMVS validation set (which contains many aerial scene images), achieving promising performance on all datasets. Among the lightweight MVS methods with low-memory consumption and fast inference speed, our F-score on the online Tanks and Temples intermediate benchmark is the highest, which shows that we have the best competitiveness in terms of balancing the performance and computational cost.
引用
收藏
页数:19
相关论文
共 54 条
[1]   Large-Scale Data for Multiple-View Stereopsis [J].
Aanaes, Henrik ;
Jensen, Rasmus Ramsbol ;
Vogiatzis, George ;
Tola, Engin ;
Dahl, Anders Bjorholm .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 120 (02) :153-168
[2]  
Baillard C., 2000, INT ARCH PHOTOGRAMM, V33, P56
[3]   PatchMatch Stereo - Stereo Matching with Slanted Support Windows [J].
Bleyer, Michael ;
Rhemann, Christoph ;
Rother, Carsten .
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,
[4]  
Burdea G.C., 2003, Virtual reality technology
[5]  
Campbell NDF, 2008, LECT NOTES COMPUT SC, V5302, P766, DOI 10.1007/978-3-540-88682-2_58
[6]   Pyramid Stereo Matching Network [J].
Chang, Jia-Ren ;
Chen, Yong-Sheng .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :5410-5418
[7]   Deep Stereo using Adaptive Thin Volume Representation with Uncertainty Awareness [J].
Cheng, Shuo ;
Xu, Zexiang ;
Zhu, Shilin ;
Li, Zhuwen ;
Li, Li Erran ;
Ramamoorthi, Ravi ;
Su, Hao .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :2521-2531
[8]   BundleFusion: Real-Time Globally Consistent 3D Reconstruction Using On-the-Fly Surface Reintegration [J].
Dai, Angela ;
Niessner, Matthias ;
Zollhofer, Michael ;
Izadi, Shahram ;
Theobalt, Christian .
ACM TRANSACTIONS ON GRAPHICS, 2017, 36 (03)
[9]  
Ding Y., 2022, P IEEECVF C COMPUTER, P8585
[10]   Accurate, Dense, and Robust Multiview Stereopsis [J].
Furukawa, Yasutaka ;
Ponce, Jean .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (08) :1362-1376