Monocular Depth Estimation: a Review of the 2022 State of the Art

被引：5

作者：

Ehret, Thibaud ^{[1
]}

机构：

[1] Univ Paris Saclay, Ctr Borelli, ENS Paris Saclay, Gif Sur Yvette, France

来源：

IMAGE PROCESSING ON LINE | 2023年 / 13卷

关键词：

depth; monocular depth estimation; deep learning; comparison; cutdepth; adabins; 3d; midas; dpt;

D O I：

10.5201/ipol.2023.459

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

We compare five monocular depth estimation methods based on deep learning. This comparison focuses on how well methods generalize rather than a quantitative comparison on a specific dataset. This study shows that while monocular depth estimation methods work well on images similar to training images, they often show artifacts when applied on images out of the training distribution. We evaluate the different methods with images similar to training data and images with unusual point of views (e.g. top-down) or paintings. The readers are invited to judge by themselves about the advantages and drawbacks of all methods by submitting their own images to the online demo associated with the present paper. Source Code The source codes and documentation for the algorithms presented in this paper are available from the web page of this article(1). Usage instructions are included in the README.md file of each archive. The original implementations of the methods are available at the following links: MiDaS and DPT methods(2), Adabins method(3), GLPDepth method(4), 3DShape method(5). This is an MLBriefs article, the source codes have not been reviewed!

引用

页码：38 / 56

页数：19

共 29 条

[1] AdaBins: Depth Estimation Using Adaptive Bins [J].

Bhat, Shariq Farooq ;

Alhashim, Ibraheem ;

Wonka, Peter .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :4008-4017

[2] A Naturalistic Open Source Movie for Optical Flow Evaluation [J].

Butler, Daniel J. ;

Wulff, Jonas ;

Stanley, Garrett B. ;

Black, Michael J. .

COMPUTER VISION - ECCV 2012, PT VI, 2012, 7577 :611-625

[3]

Chen W, 2016, ADV NEUR IN, V29

[4] Investigating Neural Architectures by Synthetic Dataset Design [J].

Courtois, Adrien ;

Morel, Jean-Michel ;

Arias, Pablo .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, :4886-4895

[5]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[6]

Eigen D, 2014, ADV NEUR IN, V27

[7] A Point Set Generation Network for 3D Object Reconstruction from a Single Image [J].

Fan, Haoqiang ;

Su, Hao ;

Guibas, Leonidas .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2463-2471

[8] Deep Ordinal Regression Network for Monocular Depth Estimation [J].

Fu, Huan ;

Gong, Mingming ;

Wang, Chaohui ;

Batmanghelich, Kayhan ;

Tao, Dacheng .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :2002-2011

[9]

Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074

[10] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

← 1 2 3 →