Rethinking Depth Estimation for Multi-View Stereo: A Unified Representation

被引:76
|
作者
Peng, Rui [1 ]
Wang, Rongjie [2 ]
Wang, Zhenyu [1 ]
Lai, Yawen [1 ]
Wang, Ronggang [1 ,2 ]
机构
[1] Peking Univ, Sch Elect & Comp Engn, Beijing, Peoples R China
[2] Peng Cheng Lab, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
SURFACE RECONSTRUCTION;
D O I
10.1109/CVPR52688.2022.00845
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Depth estimation is solved as a regression or classification problem in existing learning-based multi-view stereo methods. Although these two representations have recently demonstrated their excellent performance, they still have apparent shortcomings, e.g., regression methods tend to overfit due to the indirect learning cost volume, and classification methods cannot directly infer the exact depth due to its discrete prediction. In this paper, we propose a novel representation, termed Unification, to unify the advantages of regression and classification. It can directly constrain the cost volume like classification methods, but also realize the sub-pixel depth prediction like regression methods. To excavate the potential of unification, we design a new loss function named Unified Focal Loss, which is more uniform and reasonable to combat the challenge of sample imbalance. Combining these two unburdened modules, we present a coarse-to-fine framework, that we call UniMVSNet. The results of ranking first on both DTU and Tanks and Temples benchmarks verify that our model not only performs the best but also has the best generalization ability.
引用
收藏
页码:8635 / 8644
页数:10
相关论文
共 50 条
  • [41] Multi-View Stereo: A Tutorial
    Furukawa, Yasutaka
    Hernandez, Carlos
    FOUNDATIONS AND TRENDS IN COMPUTER GRAPHICS AND VISION, 2013, 9 (1-2): : 1 - 148
  • [42] Deep Multi-view Depth Estimation with Predicted Uncertainty
    Tong Ke
    Tien Do
    Khiem Vuong
    Sartipi, Kourosh
    Roumeliotis, Stergios, I
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 9235 - 9241
  • [43] Monocular depth estimation with multi-view attention autoencoder
    Jung, Geunho
    Yoon, Sang Min
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (23) : 33759 - 33770
  • [44] Cost Volume Pyramid Based Depth Inference for Multi-View Stereo
    Yang, Jiayu
    Mao, Wei
    Alvarez, Jose M.
    Liu, Miaomiao
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4876 - 4885
  • [45] Cost Volume Pyramid Based Depth Inference for Multi-View Stereo
    Yang, Jiayu
    Mao, Wei
    Alvarez, Jose
    Liu, Miaomiao
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 4748 - 4760
  • [46] Monocular depth estimation with multi-view attention autoencoder
    Geunho Jung
    Sang Min Yoon
    Multimedia Tools and Applications, 2022, 81 : 33759 - 33770
  • [47] PDE-based multi-view depth estimation
    Strecha, C
    Van Gool, L
    FIRST INTERNATIONAL SYMPOSIUM ON 3D DATA PROCESSING VISUALIZATION AND TRANSMISSION, 2002, : 416 - 425
  • [48] BEVStereo: Enhancing Depth Estimation in Multi-View 3D Object Detection with Temporal Stereo
    Li, Yinhao
    Bao, Han
    Ge, Zheng
    Yang, Jinrong
    Sun, Jianjian
    Li, Zeming
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 1486 - 1494
  • [49] Context-Aware Multi-view Stereo Network for Efficient Edge-Preserving Depth Estimation
    Su, Wanjuan
    Tao, Wenbing
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025,
  • [50] Multi-View Stereo Representation Revist: Region-Aware MVSNet
    Zhang, Yisu
    Zhu, Jianke
    Lin, Lixiang
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17376 - 17385