ARAI-MVSNet: A multi-view stereo depth estimation network with adaptive depth range and depth interval

被引:3
|
作者
Zhang, Song [1 ,2 ,3 ]
Xu, Wenjia [4 ]
Wei, Zhiwei [1 ,2 ]
Zhang, Lili [1 ,2 ]
Wang, Yang [1 ,2 ]
Liu, Junyi [1 ,2 ]
机构
[1] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100190, Peoples R China
[2] Chinese Acad Sci, Inst Elect, Key Lab Network Informat Syst Technol NIST, Beijing 100190, Peoples R China
[3] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100190, Peoples R China
[4] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
关键词
Multi-view stereo; Depth estimation; Adaptive range; Adaptive interval;
D O I
10.1016/j.patcog.2023.109885
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-View Stereo (MVS) is a fundamental problem in geometric computer vision which aims to reconstruct a scene using multi-view images with known camera parameters. However, the mainstream approaches represent the scene with a fixed all-pixel depth range and equal depth interval partition, which will result in inadequate utilization of depth planes and imprecise depth estimation. In this paper, we present a novel multi-stage coarse to-fine framework to achieve adaptive all-pixel depth range and depth interval. We predict a coarse depth map in the first stage, then an Adaptive Depth Range Prediction module is proposed in the second stage to zoom in the scene by leveraging the reference image and the obtained depth map in the first stage and predict a more accurate all-pixel depth range for the following stages. In the third and fourth stages, we propose an Adaptive Depth Interval Adjustment module to achieve adaptive variable interval partition for pixel-wise depth range. The depth interval distribution in this module is normalized by Z-score, which can allocate dense depth hypothesis planes around the potential ground truth depth value and vice versa to achieve more accurate depth estimation. Extensive experiments on four widely used benchmark datasets (DTU, TnT, BlendedMVS, ETH 3D) demonstrate that our model achieves state-of-the-art performance and yields competitive generalization ability. Particularly, our method achieves the highest Acc and Overall on the DTU dataset, while attaining the highest Recall and F1-score on the Tanks and Temples intermediate and advanced dataset. Moreover, our method also achieves the lowest e1 and e3 on the BlendedMVS dataset and the highest Acc and F1-score on the ETH 3D dataset, surpassing all listed methods. Project website: https://github.com/zs670980918/ARAI-MVSNet
引用
收藏
页数:10
相关论文
共 50 条
  • [31] ICV-Net: An identity cost volume network for multi-view stereo depth inference
    He, Pengpeng
    Wang, Yueju
    Wen, Yangsen
    Hu, Yong
    He, Wei
    PATTERN RECOGNITION, 2025, 162
  • [32] REAL-TIME UNSUPERVISED MULTI-VIEW DEPTH ESTIMATION NETWORK FOR VIRTUAL VIEW SYNTHESIS
    Qiu, Ke
    Gu, Song
    Liu, Shiyi
    Lai, Yawen
    Cai, Yangang
    Wang, Ronggang
    2021 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2021,
  • [33] Depth-Guided Optimization of Neural Radiance Fields for Indoor Multi-View Stereo
    Wei, Yi
    Liu, Shaohui
    Zhou, Jie
    Lu, Jiwen
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (09) : 10835 - 10849
  • [34] Attention-Enhanced Multi-View Stereo with Probabilistic Depth Variance Refinement
    Chan, Sixian
    Qiu, Aofeng
    Pan, Wei
    Mao, Jiafa
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 637 - 642
  • [35] DDL-MVS: Depth Discontinuity Learning for Multi-View Stereo Networks
    Ibrahimli, Nail
    Ledoux, Hugo
    Kooij, Julian F. P.
    Nan, Liangliang
    REMOTE SENSING, 2023, 15 (12)
  • [36] A Novel Depth Recovery Approach from Multi-View Stereo Based Focusing
    Xiao, Zhaolin
    Yang, Heng
    Wang, Qing
    Zhou, Guoqing
    2013 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY AND VISUALIZATION (ICVRV 2013), 2013, : 169 - 176
  • [37] LE-MVSNet: Lightweight Efficient Multi-view Stereo Network
    Kong, Changfei
    Zhang, Ziyi
    Mao, Jiafa
    Chan, Sixian
    Sheng, Weigou
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VIII, 2023, 14261 : 484 - 497
  • [38] Prioritized multi-view stereo depth map generation using confidence prediction
    Mostegel, Christian
    Fraundorfer, Friedrich
    Bischof, Horst
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 143 : 167 - 180
  • [39] Multi-View Stereo Using Graph Cuts-Based Depth Refinement
    Nair, Nirmal S.
    Nair, Madhu S.
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1903 - 1907
  • [40] Unstructured Multi-view Depth Estimation Using Mask-Based Multiplane Representation
    Hou, Yuxin
    Solin, Arno
    Kannala, Juho
    IMAGE ANALYSIS, 2019, 11482 : 54 - 66