SMVNet: Deep Learning Architectures for Accurate and Robust Multi-View Stereopsis

被引：0

作者：

Yao, Shizeng ^{[1
]}

Wang, Yangyang ^{[1
]}

AliAkbarpour, Hadi ^{[1
]}

Seetharaman, Guna ^{[2
]}

Rao, Raghuveer ^{[2
]}

Palaniappan, Kannappan ^{[1
]}

机构：

[1] Univ Missouri, Dept EECS, Columbia, MO 65211 USA

[2] US Naval Res Lab, Washington, DC USA

来源：

2020 IEEE APPLIED IMAGERY PATTERN RECOGNITION WORKSHOP (AIPR): TRUSTED COMPUTING, PRIVACY, AND SECURING MULTIMEDIA | 2020年

关键词：

Computer Vision; MVS; Deep Learning; City-Scale; RECONSTRUCTION;

D O I：

10.1109/AIPR50011.2020.9425188

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We describe Spatial Voxel-Net (SVNet) and Multi-View Voxel-Net (MVNet), a cascade of two novel deep learning architectures for calibrated multi-view stereopsis that reconstructs complicated outdoor 3D models accurately. Both networks use a sequence of RGB images based on ordered camera poses in a coarse-to-fine fashion. SVNet extracts summarized features and analyzes the spatial relationship among a block of 3D voxels using 3D convolutions, then predicts block-level occupancy information. MVNet then receives the occupancy information together with RGB images to predict the final voxel-level occupancy information. SMVNet is an end-to-end trainable network, which can reconstruct complex outdoor 3D models and be applied to largescale datasets in a parallel fashion without the need of estimating or fusing multiple depth maps, typical of other approaches. We evaluated SMVNet on the complex outdoor Tanks and Temples dataset, in which outperformed two well-known state-of-the-art MVS algorithms.

引用

页数：6

共 34 条

[1]

[Anonymous], 2006, 2006 IEEE COMP SOC C

[2]

[Anonymous], 2006, VISION MODELLING VIS

[3]

Bradley D., 2008, Canada Report on Bioenergy 2008, P1

[4] Flux tensor constrained geodesic active contours with sensor fusion for persistent object tracking [J].