Towards Real-Time Monocular Depth Estimation for Robotics: A Survey[-5pt]

被引:69
作者
Dong, Xingshuai [1 ]
Garratt, Matthew A. [1 ]
Anavatti, Sreenatha G. [1 ]
Abbass, Hussein A. [1 ]
机构
[1] Univ New South Wales, Sch Engn & Informat Technol, Canberra, ACT 2612, Australia
关键词
Estimation; Feature extraction; Robots; Cameras; Structure from motion; Three-dimensional displays; Task analysis; Monocular depth estimation; single image depth estimation; depth prediction; robotics; survey; OBSTACLE DETECTION; PREDICTION; NETWORK;
D O I
10.1109/TITS.2022.3160741
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
As an essential component for many autonomous driving and robotic activities such as ego-motion estimation, obstacle avoidance and scene understanding, monocular depth estimation (MDE) has attracted great attention from the computer vision and robotics communities. Over the past decades, a large number of methods have been developed. To the best of our knowledge, however, there is not a comprehensive survey of MDE. This paper aims to bridge this gap by reviewing 197 relevant articles published between 1970 and 2021. In particular, we provide a comprehensive survey of MDE covering various methods, introduce the popular performance evaluation metrics and summarize publically available datasets. We also summarize available open-source implementations of some representative methods and compare their performances. Furthermore, we review the application of MDE in some important robotic tasks. Finally, we conclude this paper by presenting some promising directions for future research. This survey is expected to assist readers to navigate this research field.
引用
收藏
页码:16940 / 16961
页数:22
相关论文
共 197 条
[81]  
Laga H., 2019, ARXIV190606113
[82]   Deeper Depth Prediction with Fully Convolutional Residual Networks [J].
Laina, Iro ;
Rupprecht, Christian ;
Belagiannis, Vasileios ;
Tombari, Federico ;
Navab, Nassir .
PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, :239-248
[83]  
Lee Jin Han, 2019, ARXIV1907
[84]   Monocular depth estimation with hierarchical fusion of dilated CNNs and soft-weighted-sum inference [J].
Li, Bo ;
Dai, Yuchao ;
He, Mingyi .
PATTERN RECOGNITION, 2018, 83 :328-339
[85]  
Li B, 2015, PROC CVPR IEEE, P1119, DOI 10.1109/CVPR.2015.7298715
[86]   Metric sensing and control of a quadrotor using a homography-based visual inertial fusion method [J].
Li, Ping ;
Garratt, Matthew ;
Lambert, Andrew ;
Lin, Shanggang .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2016, 76 :1-14
[87]  
Li R., 2018, P AS C COMP VIS, P663
[88]   MegaDepth: Learning Single-View Depth Prediction from Internet Photos [J].
Li, Zhengqi ;
Snavely, Noah .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :2041-2050
[89]  
Liang Wang, 2011, 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), P3033, DOI 10.1109/CVPR.2011.5995480
[90]  
Liebel L., 2019, ARXIV190711111