Point-Based Multi-View Stereo Network

被引:272
作者
Chen, Rui [1 ,3 ]
Han, Songfang [2 ,3 ]
Xu, Jing [1 ]
Su, Hao [3 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
[2] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[3] Univ Calif San Diego, San Diego, CA 92103 USA
来源
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) | 2019年
关键词
D O I
10.1109/ICCV.2019.00162
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce Point-MVSNet, a novel point-based deep framework for multi-view stereo (MVS). Distinct from existing cost volume approaches, our method directly processes the target scene as point clouds. More specifically, our method predicts the depth in a coarse-to-fine manner. We first generate a coarse depth map, convert it into a point cloud and refine the point cloud iteratively by estimating the residual between the depth of the current iteration and that of the ground truth. Our network leverages 3D geometry priors and 2D texture information jointly and effectively by fusing them into a feature-augmented point cloud, and processes the point cloud to estimate the 3D flow for each point. This point-based architecture allows higher accuracy, more computational efficiency and more flexibility than cost-volume-based counterparts. Experimental results show that our approach achieves a significant improvement in reconstruction quality compared with state-of-the-art methods on the DTU and the Tanks and Temples dataset.
引用
收藏
页码:1538 / 1547
页数:10
相关论文
共 31 条
[21]  
Qi C. R., 2017, ARXIV170602413CS
[22]  
Qi Charles R., 2016, ARXIV161200593CS
[23]   OctNet: Learning Deep 3D Representations at High Resolutions [J].
Riegler, Gernot ;
Ulusoy, Ali Osman ;
Geiger, Andreas .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6620-6629
[24]  
Tang Chengzhou, 2018, BA NET DENSE BUNDLE
[25]   Octree Generating Networks: Efficient Convolutional Architectures for High-resolution 3D Outputs [J].
Tatarchenko, Maxim ;
Dosovitskiy, Alexey ;
Brox, Thomas .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2107-2115
[26]   Efficient large-scale multi-view stereo for ultra high-resolution image sets [J].
Tola, Engin ;
Strecha, Christoph ;
Fua, Pascal .
MACHINE VISION AND APPLICATIONS, 2012, 23 (05) :903-920
[27]   Multiview stereo via volumetric graph-cuts and occlusion robust photo-consistency [J].
Vogiatzis, George ;
Hernandez, Carlos ;
Torr, Philip H. S. ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (12) :2241-2246
[28]   O-CNN: Octree-based Convolutional Neural Networks for 3D Shape Analysis [J].
Wang, Peng-Shuai ;
Liu, Yang ;
Guo, Yu-Xiao ;
Sun, Chun-Yu ;
Tong, Xin .
ACM TRANSACTIONS ON GRAPHICS, 2017, 36 (04)
[29]  
Wang Yue, 2018, ARXIV180107829CS
[30]  
Yao Yao, 2018, ARXIV180402505CS