Semi-Supervised Deep Learning for Monocular Depth Map Prediction

被引:420
作者
Kuznietsov, Yevhen [1 ]
Stuckle, Jorg [1 ]
Leibe, Bastian [1 ]
机构
[1] Rhein Westfal TH Aachen, Visual Comp Inst, Comp Vis Grp, Aachen, Germany
来源
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) | 2017年
关键词
D O I
10.1109/CVPR.2017.238
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Supervised deep learning often suffers from the lack of sufficient training data. Specifically in the context of monocular depth map prediction, it is barely possible to determine dense ground truth depth images in realistic dynamic outdoor environments. When using LiDAR sensors, for instance, noise is present in the distance measurements, the calibration between sensors cannot be perfect, and the measurements are typically much sparser than the camera images. In this paper, we propose a novel approach to depth map prediction from monocular images that learns in a semi-supervised way. While we use sparse ground-truth depth for supervised learning, we also enforce our deep network to produce photoconsistent dense depth maps in a stereo setup using a direct image alignment loss. In experiments we demonstrate superior performance in depth map prediction from single images compared to the state-of-the-art methods.
引用
收藏
页码:2215 / 2223
页数:9
相关论文
共 27 条
  • [21] Li B, 2015, PROC CVPR IEEE, P1119, DOI 10.1109/CVPR.2015.7298715
  • [22] Toward Holistic Scene Understanding: Feedback Enabled Cascaded Classification Models
    Li, Congcong
    Kowdle, Adarsh
    Saxena, Ashutosh
    Chen, Tsuhan
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (07) : 1394 - 1408
  • [23] Liu FY, 2015, PROC CVPR IEEE, P5162, DOI 10.1109/CVPR.2015.7299152
  • [24] Discrete-Continuous Depth Estimation from a Single Image
    Liu, Miaomiao
    Salzmann, Mathieu
    He, Xuming
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 716 - 723
  • [25] ImageNet Large Scale Visual Recognition Challenge
    Russakovsky, Olga
    Deng, Jia
    Su, Hao
    Krause, Jonathan
    Satheesh, Sanjeev
    Ma, Sean
    Huang, Zhiheng
    Karpathy, Andrej
    Khosla, Aditya
    Bernstein, Michael
    Berg, Alexander C.
    Fei-Fei, Li
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 115 (03) : 211 - 252
  • [26] 3-d depth reconstruction from a single still image
    Saxena, Ashutosh
    Chung, Sung H.
    Ng, Andrew Y.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2008, 76 (01) : 53 - 69
  • [27] Wang P, 2015, PROC CVPR IEEE, P2800, DOI 10.1109/CVPR.2015.7298897