Prior depth-based multi-view stereo network for online 3D model reconstruction

被引：13

作者：

Song, Soohwan ^{[1
]}

Truong, Khang Giang ^{[2
]}

Kim, Daekyum ^{[3
]}

Jo, Sungho ^{[2
]}

机构：

[1] ETRI, Intelligent Robot Res Div, Daejeon 34129, South Korea

[2] Korea Adv Inst Sci & Technol, Sch Comp, Daejeon 34141, South Korea

[3] Harvard Univ, John A Paulson Sch Engn & Appl Sci, Cambridge, MA 02138 USA

来源：

PATTERN RECOGNITION | 2023年 / 136卷

基金：

新加坡国家研究基金会;

关键词：

Multi-view stereo; Deep learning; Online 3D reconstruction; SLAM;

D O I：

10.1016/j.patcog.2022.109198

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This study addresses the online multi-view stereo (MVS) problem when reconstructing precise 3D mod-els in real time. To solve this problem, most previous studies adopted a motion stereo approach that sequentially estimates depth maps from multiple localized images captured in a local time window. To compute the depth maps quickly, the motion stereo methods process down-sampled images or use a simplified algorithm for cost volume regularization; therefore, they generally produce reconstructed 3D models that are inaccurate. In this paper, we propose a novel online MVS method that accurately re-constructs high-resolution 3D models. This method infers prior depth information based on sequentially estimated depths and leverages it to estimate depth maps more precisely. The method constructs a cost volume by using the prior-depth-based visibility information and then fuses the prior depths into the cost volume. This approach significantly improves the stereo matching performance and completeness of the estimated depths. Extensive experiments showed that the proposed method outperforms other state-of-the-art MVS and motion stereo methods. In particular, it significantly improves the completeness of 3D models.(c) 2022 Elsevier Ltd. All rights reserved.

引用

页数：12

共 39 条

[1] Large-Scale Data for Multiple-View Stereopsis
Aanaes, Henrik
Jensen, Rasmus Ramsbol
Vogiatzis, George
Tola, Engin
Dahl, Anders Bjorholm
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 120 (02) : 153 - 168
[2] Deep Stereo using Adaptive Thin Volume Representation with Uncertainty Awareness
Cheng, Shuo
Xu, Zexiang
Zhu, Shilin
Li, Zhuwen
Li, Li Erran
Ramamoorthi, Ravi
Su, Hao
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2521 - 2531
[3] Confidence Propagation through CNNs for Guided Sparse Depth Regression
Eldesokey, Abdelrahman
Felsberg, Michael
Khan, Fahad Shahbaz
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (10) : 2423 - 2436
[4] Blitz-SLAM: A semantic SLAM in dynamic environments
Fan, Yingchun
Zhang, Qichi
Tang, Yuliang
Liu, Shaofen
Han, Hong
[J]. PATTERN RECOGNITION, 2022, 121
[5] Accurate, Dense, and Robust Multiview Stereopsis
Furukawa, Yasutaka
Ponce, Jean
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (08) : 1362 - 1376
[6] Massively Parallel Multiview Stereopsis by Surface Normal Diffusion
Galliani, Silvano
Lasinger, Katrin
Schindler, Konrad
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 873 - 881
[7] 3D modeling of multiple-object scenes from sets of images
Grum, Matthew
Bors, Adrian G.
[J]. PATTERN RECOGNITION, 2014, 47 (01) : 326 - 343
[8] Cascade Cost Volume for High-Resolution Multi-View Stereo and Stereo Matching
Gu, Xiaodong
Fan, Zhiwen
Zhu, Siyu
Dai, Zuozhuo
Tan, Feitong
Tan, Ping
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2492 - 2501
[9] Learned Multi-Patch Similarity
Hartmann, Wilfried
Galliani, Silvano
Havlena, Michal
Van Gool, Luc
Schindler, Konrad
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1595 - 1603
[10] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778

← 1 2 3 4 →