Adaptive depth estimation for pyramid multi-view stereo

被引:13
作者
Liao, Jie [1 ]
Fu, Yanping [2 ]
Yan, Qingan [3 ]
Luo, Fei [1 ]
Xiao, Chunxia [1 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China
[2] Anhui Univ, Sch Comp Sci & Technol, Hefei, Peoples R China
[3] JD Com, Silicon Valley Res Ctr Multimedia Software, Beijing, Peoples R China
来源
COMPUTERS & GRAPHICS-UK | 2021年 / 97卷
关键词
3D Reconstruction; Multi-View Stereo; Deep Learning; RECONSTRUCTION;
D O I
10.1016/j.cag.2021.04.016
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper, we propose a Multi-View Stereo (MVS) network which can perform efficient high-resolution depth estimation with low memory consumption. Classical learning-based MVS approaches typically construct 3D cost volumes to regress depth information, making the output resolution rather limited as the memory consumption grows cubically with the input resolution. Although recent approaches have made significant progress in scalability by introducing the coarse-to-fine fashion or sequential cost map regularization, the memory consumption still grows quadratically with input resolution and is not friendly for commodity GPU. Observing that the surfaces of most objects in real world are locally smooth, we assume that most of the depth hypotheses upsampled from a well-estimated depth map are accurate. Based on the assumption, we propose a pyramid MVS network based on the adaptive depth estimation, which gradually refines and upsamples the depth map to the desired resolution. Instead of estimating depth hypotheses for all pixels in the depth map, our method only performs prediction at adaptively selected locations, alleviating excessive computation on well-estimated positions. To estimate depth hypotheses for sparse selected locations, we propose the lightweight pixelwise depth estimation network, which can estimate depth value for each selected location independently. Experiments demonstrate that our method can generate results comparable with the state-of-the-art learning-based methods while reconstructing more geometric details and consuming less GPU memory. (c) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页码:268 / 278
页数:11
相关论文
共 33 条
[1]   Large-Scale Data for Multiple-View Stereopsis [J].
Aanaes, Henrik ;
Jensen, Rasmus Ramsbol ;
Vogiatzis, George ;
Tola, Engin ;
Dahl, Anders Bjorholm .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 120 (02) :153-168
[2]   Building Rome in a Day [J].
Agarwal, Sameer ;
Snavely, Noah ;
Simon, Ian ;
Seitz, Steven M. ;
Szeliski, Richard .
2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, :72-79
[3]  
Campbell NDF, 2008, LECT NOTES COMPUT SC, V5302, P766, DOI 10.1007/978-3-540-88682-2_58
[4]   Point-Based Multi-View Stereo Network [J].
Chen, Rui ;
Han, Songfang ;
Xu, Jing ;
Su, Hao .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :1538-1547
[5]   A space-sweep approach to true multi-image matching [J].
Collins, RT .
1996 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1996, :358-363
[6]   Accurate, Dense, and Robust Multiview Stereopsis [J].
Furukawa, Yasutaka ;
Ponce, Jean .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (08) :1362-1376
[7]   Massively Parallel Multiview Stereopsis by Surface Normal Diffusion [J].
Galliani, Silvano ;
Lasinger, Katrin ;
Schindler, Konrad .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :873-881
[8]  
Gu X., 2020, CVPR
[9]   DeepMVS: Learning Multi-view Stereopsis [J].
Huang, Po-Han ;
Matzen, Kevin ;
Kopf, Johannes ;
Ahuja, Narendra ;
Huang, Jia-Bin .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :2821-2830
[10]   Large Scale Multi-view Stereopsis Evaluation [J].
Jensen, Rasmus ;
Dahl, Anders ;
Vogiatzis, George ;
Tola, Engin ;
Aanaes, Henrik .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :406-413