A Two-Streamed Network for Estimating Fine-Scaled Depth Maps from Single RGB Images

被引:165
作者
Li, Jun [1 ,2 ]
Klein, Reinhard [1 ]
Yao, Angela [1 ]
机构
[1] Univ Bonn, Bonn, Germany
[2] Natl Univ Def Technol, Changsha, Hunan, Peoples R China
来源
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2017年
关键词
D O I
10.1109/ICCV.2017.365
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Estimating depth from a single RGB image is an ill-posed and inherently ambiguous problem. State-of-the-art deep learning methods can now estimate accurate 2D depth maps, but when the maps are projected into 3D, they lack local detail and are often highly distorted. We propose a fast-to-train two-streamed CNN that predicts depth and depth gradients, which are then fused together into an accurate and detailed depth map. We also define a novel set loss over multiple images; by regularizing the estimation between a common set of images, the network is less prone to over-fitting and achieves better accuracy than competing methods. Experiments on the NYU Depth v2 dataset shows that our depth predictions are competitive with state-of-the-art and lead to faithful 3D projections.
引用
收藏
页码:3392 / 3400
页数:9
相关论文
共 30 条
[1]  
[Anonymous], ICCV
[2]  
[Anonymous], 2016, NIPS
[3]  
[Anonymous], CVPR
[4]  
[Anonymous], 2015, ICCV
[5]  
[Anonymous], EURASIP J WIRELESS C
[6]  
[Anonymous], 2015, ICCV
[7]  
[Anonymous], ECCV
[8]  
[Anonymous], 2016, ARXIV160600915
[9]  
[Anonymous], 2016, ECCV
[10]  
[Anonymous], 2012, ECCV