SMVNet: Deep Learning Architectures for Accurate and Robust Multi-View Stereopsis

被引:0
作者
Yao, Shizeng [1 ]
Wang, Yangyang [1 ]
AliAkbarpour, Hadi [1 ]
Seetharaman, Guna [2 ]
Rao, Raghuveer [2 ]
Palaniappan, Kannappan [1 ]
机构
[1] Univ Missouri, Dept EECS, Columbia, MO 65211 USA
[2] US Naval Res Lab, Washington, DC USA
来源
2020 IEEE APPLIED IMAGERY PATTERN RECOGNITION WORKSHOP (AIPR): TRUSTED COMPUTING, PRIVACY, AND SECURING MULTIMEDIA | 2020年
关键词
Computer Vision; MVS; Deep Learning; City-Scale; RECONSTRUCTION;
D O I
10.1109/AIPR50011.2020.9425188
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe Spatial Voxel-Net (SVNet) and Multi-View Voxel-Net (MVNet), a cascade of two novel deep learning architectures for calibrated multi-view stereopsis that reconstructs complicated outdoor 3D models accurately. Both networks use a sequence of RGB images based on ordered camera poses in a coarse-to-fine fashion. SVNet extracts summarized features and analyzes the spatial relationship among a block of 3D voxels using 3D convolutions, then predicts block-level occupancy information. MVNet then receives the occupancy information together with RGB images to predict the final voxel-level occupancy information. SMVNet is an end-to-end trainable network, which can reconstruct complex outdoor 3D models and be applied to largescale datasets in a parallel fashion without the need of estimating or fusing multiple depth maps, typical of other approaches. We evaluated SMVNet on the complex outdoor Tanks and Temples dataset, in which outperformed two well-known state-of-the-art MVS algorithms.
引用
收藏
页数:6
相关论文
共 34 条
[1]  
[Anonymous], 2006, 2006 IEEE COMP SOC C
[2]  
[Anonymous], 2006, VISION MODELLING VIS
[3]  
Bradley D., 2008, Canada Report on Bioenergy 2008, P1
[4]   Flux tensor constrained geodesic active contours with sensor fusion for persistent object tracking [J].
Bunyak, Filiz ;
Palaniappan, Kannappan ;
Nath, Sumit Kumar ;
Seetharaman, Gunasekaran .
Journal of Multimedia, 2007, 2 (04) :20-33
[5]  
Eigen D, 2014, ADV NEUR IN, V27
[6]   Silhouette and stereo fusion for 3D object modeling [J].
Esteban, CH ;
Schmitt, F .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2004, 96 (03) :367-392
[7]   Variational principles, surface evolution, PDE's, level set methods, and the stereo problem [J].
Faugeras, O ;
Keriven, R .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1998, 7 (03) :336-344
[8]   DeepStereo: Learning to Predict New Views from the World's Imagery [J].
Flynn, John ;
Neulander, Ivan ;
Philbin, James ;
Snavely, Noah .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :5515-5524
[9]   Accurate, Dense, and Robust Multiview Stereopsis [J].
Furukawa, Yasutaka ;
Ponce, Jean .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (08) :1362-1376
[10]   Towards Internet-scale Multi-view Stereo [J].
Furukawa, Yasutaka ;
Curless, Brian ;
Seitz, Steven M. ;
Szeliski, Richard .
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :1434-1441