Prior depth-based multi-view stereo network for online 3D model reconstruction

被引：13

作者：

Song, Soohwan ^{[1
]}

Truong, Khang Giang ^{[2
]}

Kim, Daekyum ^{[3
]}

Jo, Sungho ^{[2
]}

机构：

[1] ETRI, Intelligent Robot Res Div, Daejeon 34129, South Korea

[2] Korea Adv Inst Sci & Technol, Sch Comp, Daejeon 34141, South Korea

[3] Harvard Univ, John A Paulson Sch Engn & Appl Sci, Cambridge, MA 02138 USA

来源：

PATTERN RECOGNITION | 2023年 / 136卷

基金：

新加坡国家研究基金会;

关键词：

Multi-view stereo; Deep learning; Online 3D reconstruction; SLAM;

D O I：

10.1016/j.patcog.2022.109198

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This study addresses the online multi-view stereo (MVS) problem when reconstructing precise 3D mod-els in real time. To solve this problem, most previous studies adopted a motion stereo approach that sequentially estimates depth maps from multiple localized images captured in a local time window. To compute the depth maps quickly, the motion stereo methods process down-sampled images or use a simplified algorithm for cost volume regularization; therefore, they generally produce reconstructed 3D models that are inaccurate. In this paper, we propose a novel online MVS method that accurately re-constructs high-resolution 3D models. This method infers prior depth information based on sequentially estimated depths and leverages it to estimate depth maps more precisely. The method constructs a cost volume by using the prior-depth-based visibility information and then fuses the prior depths into the cost volume. This approach significantly improves the stereo matching performance and completeness of the estimated depths. Extensive experiments showed that the proposed method outperforms other state-of-the-art MVS and motion stereo methods. In particular, it significantly improves the completeness of 3D models.(c) 2022 Elsevier Ltd. All rights reserved.

引用

页数：12

共 39 条

[21] ORB-SLAM: A Versatile and Accurate Monocular SLAM System
Mur-Artal, Raul
Montiel, J. M. M.
Tardos, Juan D.
[J]. IEEE TRANSACTIONS ON ROBOTICS, 2015, 31 (05) : 1147 - 1163
[22] Newcombe RA, 2011, IEEE I CONF COMP VIS, P2320, DOI 10.1109/ICCV.2011.6126513
[23] Pizzoli M, 2014, IEEE INT CONF ROBOT, P2609, DOI 10.1109/ICRA.2014.6907233
[24] A semi-supervised approach to space carving
Prakash, Surya
Robles-Kelly, Antonio
[J]. PATTERN RECOGNITION, 2010, 43 (02) : 506 - 518
[25] U-Net: Convolutional Networks for Biomedical Image Segmentation
Ronneberger, Olaf
Fischer, Philipp
Brox, Thomas
[J]. MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 : 234 - 241
[26] Pixelwise View Selection for Unstructured Multi-View Stereo
Schonberger, Johannes L.
Zheng, Enliang
Frahm, Jan-Michael
Pollefeys, Marc
[J]. COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 : 501 - 518
[27] View Path Planning via Online Multiview Stereo for 3-D Modeling of Large-Scale Structures
Song, Soohwan
Kim, Daekyum
Choi, Sunghee
[J]. IEEE TRANSACTIONS ON ROBOTICS, 2022, 38 (01) : 372 - 390
[28] Tang C., 2018, arXiv
[29] Uncertainty estimation for stereo matching based on evidential deep learning
Wang, Chen
Wang, Xiang
Zhang, Jiawei
Zhang, Liang
Bai, Xiao
Ning, Xin
Zhou, Jun
Hancock, Edwin
[J]. PATTERN RECOGNITION, 2022, 124
[30] Wang FJH, 2021, Arxiv, DOI [arXiv:2112.05126, DOI 10.48550/ARXIV.2112.05126]

← 1 2 3 4 →