ARAI-MVSNet: A multi-view stereo depth estimation network with adaptive depth range and depth interval

被引:3
|
作者
Zhang, Song [1 ,2 ,3 ]
Xu, Wenjia [4 ]
Wei, Zhiwei [1 ,2 ]
Zhang, Lili [1 ,2 ]
Wang, Yang [1 ,2 ]
Liu, Junyi [1 ,2 ]
机构
[1] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100190, Peoples R China
[2] Chinese Acad Sci, Inst Elect, Key Lab Network Informat Syst Technol NIST, Beijing 100190, Peoples R China
[3] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100190, Peoples R China
[4] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
关键词
Multi-view stereo; Depth estimation; Adaptive range; Adaptive interval;
D O I
10.1016/j.patcog.2023.109885
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-View Stereo (MVS) is a fundamental problem in geometric computer vision which aims to reconstruct a scene using multi-view images with known camera parameters. However, the mainstream approaches represent the scene with a fixed all-pixel depth range and equal depth interval partition, which will result in inadequate utilization of depth planes and imprecise depth estimation. In this paper, we present a novel multi-stage coarse to-fine framework to achieve adaptive all-pixel depth range and depth interval. We predict a coarse depth map in the first stage, then an Adaptive Depth Range Prediction module is proposed in the second stage to zoom in the scene by leveraging the reference image and the obtained depth map in the first stage and predict a more accurate all-pixel depth range for the following stages. In the third and fourth stages, we propose an Adaptive Depth Interval Adjustment module to achieve adaptive variable interval partition for pixel-wise depth range. The depth interval distribution in this module is normalized by Z-score, which can allocate dense depth hypothesis planes around the potential ground truth depth value and vice versa to achieve more accurate depth estimation. Extensive experiments on four widely used benchmark datasets (DTU, TnT, BlendedMVS, ETH 3D) demonstrate that our model achieves state-of-the-art performance and yields competitive generalization ability. Particularly, our method achieves the highest Acc and Overall on the DTU dataset, while attaining the highest Recall and F1-score on the Tanks and Temples intermediate and advanced dataset. Moreover, our method also achieves the lowest e1 and e3 on the BlendedMVS dataset and the highest Acc and F1-score on the ETH 3D dataset, surpassing all listed methods. Project website: https://github.com/zs670980918/ARAI-MVSNet
引用
收藏
页数:10
相关论文
共 50 条
  • [1] MVSNet: Depth Inference for Unstructured Multi-view Stereo
    Yao, Yao
    Luo, Zixin
    Li, Shiwei
    Fang, Tian
    Quan, Long
    COMPUTER VISION - ECCV 2018, PT VIII, 2018, 11212 : 785 - 801
  • [2] Adaptive depth estimation for pyramid multi-view stereo
    Liao, Jie
    Fu, Yanping
    Yan, Qingan
    Luo, Fei
    Xiao, Chunxia
    COMPUTERS & GRAPHICS-UK, 2021, 97 : 268 - 278
  • [3] Uncertainty Guided Multi-View Stereo Network for Depth Estimation
    Su, Wanjuan
    Xu, Qingshan
    Tao, Wenbing
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) : 7796 - 7808
  • [4] IAFMVS: Iterative Depth Estimation with Adaptive Features for Multi-View Stereo
    Zhao, Guyu
    Wei, Huyixin
    He, Hongdou
    NEUROCOMPUTING, 2025, 629
  • [5] DS-MVSNet: Unsupervised Multi-view Stereo via Depth Synthesis
    Li, Jingliang
    Lu, Zhengda
    Wang, Yiqun
    Wang, Ying
    Xiao, Jun
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5593 - 5601
  • [6] Unsupervised multi-view stereo network based on multi-stage depth estimation
    Qi, Shuai
    Sang, Xinzhu
    Yan, Binbin
    Wang, Peng
    Chen, Duo
    Wang, Huachun
    Ye, Xiaoqian
    IMAGE AND VISION COMPUTING, 2022, 122
  • [7] FADE: Feature Aggregation for Depth Estimation With Multi-View Stereo
    Yang, Hsiao-Chien
    Chen, Po-Heng
    Chen, Kuan-Wen
    Lee, Chen-Yi
    Chen, Yong-Sheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 6590 - 6600
  • [8] REVISED DEPTH MAP ESTIMATION FOR MULTI-VIEW STEREO
    Yao, Yao
    Zhu, Hao
    Nie, Yongming
    Ji, Xiaoli
    Cao, Xun
    2014 INTERNATIONAL CONFERENCE ON 3D IMAGING (IC3D), 2014,
  • [9] A cascade network with adaptive depth hypotheses estimation for multi-view stereo and image three-dimensional reconstruction
    Wang, Dong
    Liu, Zhong
    Yue, Haosong
    Wu, Xingming
    Chen, Weihai
    2024 IEEE 19TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, ICIEA 2024, 2024,
  • [10] Depth Estimation in Multi-View Stereo Based on Image Pyramid
    Xu, Hanfei
    Cai, Yangang
    Wang, Ronggang
    PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2018) / 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY (ICIMT 2018), 2018, : 345 - 349