Depth Estimation from Monocular Images Using Dilated Convolution and Uncertainty Learning

被引:1
作者
Ma, Haojie [1 ,2 ]
Ding, Yinzhang [1 ,2 ]
Wang, Lianghao [1 ,2 ,3 ]
Zhang, Ming [1 ,2 ]
Li, Dongxiao [1 ,2 ]
机构
[1] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Zhejiang, Peoples R China
[2] Zhejiang Prov Key Lab Informat Proc Commun & Netw, Hangzhou 310027, Zhejiang, Peoples R China
[3] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Jiangsu, Peoples R China
来源
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2018, PT II | 2018年 / 11165卷
关键词
Depth estimation; Dilated convolution; Convolutional neural network; Uncertainty;
D O I
10.1007/978-3-030-00767-6_2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Depth cues are vital in many challenging computer vision tasks. In this paper, we address the problem of dense depth prediction from a single RGB image. Compared with stereo depth estimation, sensing the depth of a scene from monocular images is much more difficult and ambiguous because the epipolar geometry constraints cannot be exploited. In addition, the value of the scale is often unknown in monocular depth prediction. To facilitate an accurate single-view depth prediction, we introduce dilated convolution to capture multi-scale contextual information and then present a deep convolutional neural network. To improve the robustness of the system, we estimate the uncertainty of noisy data by modelling such uncertainty in a new loss function. The experiment results show that the proposed approach outperforms the previous state-of-the-art methods in depth estimation tasks.
引用
收藏
页码:13 / 23
页数:11
相关论文
共 26 条
[1]  
[Anonymous], 2015, PROC CVPR IEEE
[2]  
[Anonymous], 2016, ABS160602147 CORR
[3]  
[Anonymous], 2012, PROC CVPR IEEE
[4]  
[Anonymous], INT C NEUR INF PROC
[5]  
[Anonymous], INT C LEARN REPR CAR
[6]  
Bertasius G, 2015, PROC CVPR IEEE, P4380, DOI 10.1109/CVPR.2015.7299067
[7]  
Eigen D, 2014, ADV NEUR IN, V27
[8]   Predicting Depth, Surface Normals and Semantic Labels with a Common Multi-Scale Convolutional Architecture [J].
Eigen, David ;
Fergus, Rob .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2650-2658
[9]   Unsupervised CNN for Single View Depth Estimation: Geometry to the Rescue [J].
Garg, Ravi ;
VijayKumar, B. G. ;
Carneiro, Gustavo ;
Reid, Ian .
COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 :740-756
[10]   Unsupervised Monocular Depth Estimation with Left-Right Consistency [J].
Godard, Clement ;
Mac Aodha, Oisin ;
Brostow, Gabriel J. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6602-6611