Prior depth-based multi-view stereo network for online 3D model reconstruction

被引:13
作者
Song, Soohwan [1 ]
Truong, Khang Giang [2 ]
Kim, Daekyum [3 ]
Jo, Sungho [2 ]
机构
[1] ETRI, Intelligent Robot Res Div, Daejeon 34129, South Korea
[2] Korea Adv Inst Sci & Technol, Sch Comp, Daejeon 34141, South Korea
[3] Harvard Univ, John A Paulson Sch Engn & Appl Sci, Cambridge, MA 02138 USA
基金
新加坡国家研究基金会;
关键词
Multi-view stereo; Deep learning; Online 3D reconstruction; SLAM;
D O I
10.1016/j.patcog.2022.109198
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study addresses the online multi-view stereo (MVS) problem when reconstructing precise 3D mod-els in real time. To solve this problem, most previous studies adopted a motion stereo approach that sequentially estimates depth maps from multiple localized images captured in a local time window. To compute the depth maps quickly, the motion stereo methods process down-sampled images or use a simplified algorithm for cost volume regularization; therefore, they generally produce reconstructed 3D models that are inaccurate. In this paper, we propose a novel online MVS method that accurately re-constructs high-resolution 3D models. This method infers prior depth information based on sequentially estimated depths and leverages it to estimate depth maps more precisely. The method constructs a cost volume by using the prior-depth-based visibility information and then fuses the prior depths into the cost volume. This approach significantly improves the stereo matching performance and completeness of the estimated depths. Extensive experiments showed that the proposed method outperforms other state-of-the-art MVS and motion stereo methods. In particular, it significantly improves the completeness of 3D models.(c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:12
相关论文
共 39 条
  • [1] Large-Scale Data for Multiple-View Stereopsis
    Aanaes, Henrik
    Jensen, Rasmus Ramsbol
    Vogiatzis, George
    Tola, Engin
    Dahl, Anders Bjorholm
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 120 (02) : 153 - 168
  • [2] Deep Stereo using Adaptive Thin Volume Representation with Uncertainty Awareness
    Cheng, Shuo
    Xu, Zexiang
    Zhu, Shilin
    Li, Zhuwen
    Li, Li Erran
    Ramamoorthi, Ravi
    Su, Hao
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2521 - 2531
  • [3] Confidence Propagation through CNNs for Guided Sparse Depth Regression
    Eldesokey, Abdelrahman
    Felsberg, Michael
    Khan, Fahad Shahbaz
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (10) : 2423 - 2436
  • [4] Blitz-SLAM: A semantic SLAM in dynamic environments
    Fan, Yingchun
    Zhang, Qichi
    Tang, Yuliang
    Liu, Shaofen
    Han, Hong
    [J]. PATTERN RECOGNITION, 2022, 121
  • [5] Accurate, Dense, and Robust Multiview Stereopsis
    Furukawa, Yasutaka
    Ponce, Jean
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (08) : 1362 - 1376
  • [6] Massively Parallel Multiview Stereopsis by Surface Normal Diffusion
    Galliani, Silvano
    Lasinger, Katrin
    Schindler, Konrad
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 873 - 881
  • [7] 3D modeling of multiple-object scenes from sets of images
    Grum, Matthew
    Bors, Adrian G.
    [J]. PATTERN RECOGNITION, 2014, 47 (01) : 326 - 343
  • [8] Cascade Cost Volume for High-Resolution Multi-View Stereo and Stereo Matching
    Gu, Xiaodong
    Fan, Zhiwen
    Zhu, Siyu
    Dai, Zuozhuo
    Tan, Feitong
    Tan, Ping
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2492 - 2501
  • [9] Learned Multi-Patch Similarity
    Hartmann, Wilfried
    Galliani, Silvano
    Havlena, Michal
    Van Gool, Luc
    Schindler, Konrad
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1595 - 1603
  • [10] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778