Prior depth-based multi-view stereo network for online 3D model reconstruction

被引:13
作者
Song, Soohwan [1 ]
Truong, Khang Giang [2 ]
Kim, Daekyum [3 ]
Jo, Sungho [2 ]
机构
[1] ETRI, Intelligent Robot Res Div, Daejeon 34129, South Korea
[2] Korea Adv Inst Sci & Technol, Sch Comp, Daejeon 34141, South Korea
[3] Harvard Univ, John A Paulson Sch Engn & Appl Sci, Cambridge, MA 02138 USA
基金
新加坡国家研究基金会;
关键词
Multi-view stereo; Deep learning; Online 3D reconstruction; SLAM;
D O I
10.1016/j.patcog.2022.109198
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study addresses the online multi-view stereo (MVS) problem when reconstructing precise 3D mod-els in real time. To solve this problem, most previous studies adopted a motion stereo approach that sequentially estimates depth maps from multiple localized images captured in a local time window. To compute the depth maps quickly, the motion stereo methods process down-sampled images or use a simplified algorithm for cost volume regularization; therefore, they generally produce reconstructed 3D models that are inaccurate. In this paper, we propose a novel online MVS method that accurately re-constructs high-resolution 3D models. This method infers prior depth information based on sequentially estimated depths and leverages it to estimate depth maps more precisely. The method constructs a cost volume by using the prior-depth-based visibility information and then fuses the prior depths into the cost volume. This approach significantly improves the stereo matching performance and completeness of the estimated depths. Extensive experiments showed that the proposed method outperforms other state-of-the-art MVS and motion stereo methods. In particular, it significantly improves the completeness of 3D models.(c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:12
相关论文
共 39 条
  • [21] ORB-SLAM: A Versatile and Accurate Monocular SLAM System
    Mur-Artal, Raul
    Montiel, J. M. M.
    Tardos, Juan D.
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2015, 31 (05) : 1147 - 1163
  • [22] Newcombe RA, 2011, IEEE I CONF COMP VIS, P2320, DOI 10.1109/ICCV.2011.6126513
  • [23] Pizzoli M, 2014, IEEE INT CONF ROBOT, P2609, DOI 10.1109/ICRA.2014.6907233
  • [24] A semi-supervised approach to space carving
    Prakash, Surya
    Robles-Kelly, Antonio
    [J]. PATTERN RECOGNITION, 2010, 43 (02) : 506 - 518
  • [25] U-Net: Convolutional Networks for Biomedical Image Segmentation
    Ronneberger, Olaf
    Fischer, Philipp
    Brox, Thomas
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 : 234 - 241
  • [26] Pixelwise View Selection for Unstructured Multi-View Stereo
    Schonberger, Johannes L.
    Zheng, Enliang
    Frahm, Jan-Michael
    Pollefeys, Marc
    [J]. COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 : 501 - 518
  • [27] View Path Planning via Online Multiview Stereo for 3-D Modeling of Large-Scale Structures
    Song, Soohwan
    Kim, Daekyum
    Choi, Sunghee
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2022, 38 (01) : 372 - 390
  • [28] Tang C., 2018, arXiv
  • [29] Uncertainty estimation for stereo matching based on evidential deep learning
    Wang, Chen
    Wang, Xiang
    Zhang, Jiawei
    Zhang, Liang
    Bai, Xiao
    Ning, Xin
    Zhou, Jun
    Hancock, Edwin
    [J]. PATTERN RECOGNITION, 2022, 124
  • [30] Wang FJH, 2021, Arxiv, DOI [arXiv:2112.05126, DOI 10.48550/ARXIV.2112.05126]