DS-MVSNet: Unsupervised Multi-view Stereo via Depth Synthesis

被引：10

作者：

Li, Jingliang ^{[1
]}

Lu, Zhengda ^{[1
]}

Wang, Yiqun ^{[2
,3
]}

Wang, Ying ^{[1
]}

Xiao, Jun ^{[1
]}

机构：

[1] Univ Chinese Acad Sci, Sch AI, Beijing, Peoples R China

[2] Chongqing Univ, Coll Comp Sci, Chongqing, Peoples R China

[3] KAUST, Thuwal, Saudi Arabia

来源：

PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022 | 2022年

基金：

中国国家自然科学基金;

关键词：

multi-views stereo; 3D reconstruction; depth estimation; SURFACE RECONSTRUCTION;

D O I：

10.1145/3503161.3548352

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

In recent years, supervised or unsupervised learning-based MVS methods achieved excellent performance compared with traditional methods. However, these methods only use the probability volume computed by cost volume regularization to predict reference depths and this manner cannot mine enough information from the probability volume. Furthermore, the unsupervised methods usually try to use two-step or additional inputs for training which make the procedure more complicated. In this paper, we propose the DS-MVSNet, an end-to-end unsupervised MVS structure with the source depths synthesis. To mine the information in probability volume, we creatively synthesize the source depths by splattering the probability volume and depth hypotheses to source views. Meanwhile, we propose the adaptive Gaussian sampling and improved adaptive bins sampling approach that improve the depths hypotheses accuracy. On the other hand, we utilize the source depths to render the reference images and propose depth consistency loss and depth smoothness loss. These can provide additional guidance according to photometric and geometric consistency in different views without additional inputs. Finally, we conduct a series of experiments on the DTU dataset and Tanks & Temples dataset that demonstrate the efficiency and robustness of our DS-MVSNet compared with the state-of-the-art methods.

引用

页码：5593 / 5601

页数：9

共 41 条

[1] Large-Scale Data for Multiple-View Stereopsis [J].

Aanaes, Henrik ;

Jensen, Rasmus Ramsbol ;

Vogiatzis, George ;

Tola, Engin ;

Dahl, Anders Bjorholm .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 120 (02) :153-168

[2]

[Anonymous], 2006, 2006 IEEE COMPUTER S, DOI [10.1109/CVPR.2006.199, DOI 10.1109/CVPR.2006.199]

[3] AdaBins: Depth Estimation Using Adaptive Bins [J].

Bhat, Shariq Farooq ;

Alhashim, Ibraheem ;

Wonka, Peter .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :4008-4017

[4]

Campbell N., 2008, EUR C COMP VIS, V1, P766, DOI DOI 10.1007/978-3-540-88682-2_58

[5] Deep Stereo using Adaptive Thin Volume Representation with Uncertainty Awareness [J].

Cheng, Shuo ;

Xu, Zexiang ;

Zhu, Shilin ;

Li, Zhuwen ;

Li, Li Erran ;

Ramamoorthi, Ravi ;

Su, Hao .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :2521-2531

[6]

Curless B., 1996, Computer Graphics Proceedings. SIGGRAPH '96, P303, DOI 10.1145/237170.237269

[7] An Adaptive EKF-FMPC for the Trajectory Tracking of UVMS [J].

Dai, Yong ;

Yu, Shuanghe ;

Yan, Yan .

IEEE JOURNAL OF OCEANIC ENGINEERING, 2020, 45 (03) :699-713

[8] OBJECT-CENTERED SURFACE RECONSTRUCTION - COMBINING MULTIIMAGE STEREO AND SHADING [J].

FUA, P ;

LECLERC, YG .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 1995, 16 (01) :35-56

[9] Accurate, Dense, and Robust Multiview Stereopsis [J].

Furukawa, Yasutaka ;

Ponce, Jean .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (08) :1362-1376

[10] Massively Parallel Multiview Stereopsis by Surface Normal Diffusion [J].

Galliani, Silvano ;

Lasinger, Katrin ;

Schindler, Konrad .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :873-881

← 1 2 3 4 5 →