Stereo-augmented Depth Completion from a Single RGB-LiDAR image

被引:7
作者
Choi, Keunhoon [1 ]
Jeong, Somi [1 ]
Kim, Youngjung [2 ]
Sohn, Kwanghoon [1 ]
机构
[1] Yonsei Univ, Sch Elect & Elect Engn, Seoul 03722, South Korea
[2] Agcy Def Dev ADD, Daejeon 34060, South Korea
来源
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021) | 2021年
关键词
D O I
10.1109/ICRA48506.2021.9561557
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Depth completion is an important task in computer vision and robotics applications, which aims at predicting accurate dense depth from a single RGB-LiDAR image. Convolutional neural networks (CNNs) have been widely used for depth completion to learn a mapping function from sparse to dense depth. However, recent methods do not exploit any 3D geometric cues during the inference stage and mainly rely on sophisticated CNN architectures. In this paper, we present a cascade and geometrically inspired learning framework for depth completion, consisting of three stages: view extrapolation, stereo matching, and depth refinement. The first stage extrapolates a virtual (right) view using a single RGB (left) and its LiDAR data. We then mimic the binocular stereo-matching, and as a result, explicitly encode geometric constraints during depth completion. This stage augments the final refinement process by providing additional geometric reasoning. We also introduce a distillation framework based on teacher-student strategy to effectively train our network. Knowledge from a teacher model privileged with real stereo pairs is transferred to the student through feature distillation. Experimental results on KITTI depth completion benchmark demonstrate that the proposed method is superior to state-of-the-art methods.
引用
收藏
页码:13641 / 13647
页数:7
相关论文
共 37 条
[1]  
[Anonymous], 2020, AAAI, DOI DOI 10.1109/BIBM49941.2020.9313492
[2]  
[Anonymous], 2017, 3DV, DOI DOI 10.1109/3DV.2017.00012
[3]   Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convolution [J].
Chen, Yunpeng ;
Fan, Haoqi ;
Xu, Bing ;
Yan, Zhicheng ;
Kalantidis, Yannis ;
Rohrbach, Marcus ;
Yan, Shuicheng ;
Feng, Jiashi .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3434-3443
[4]  
Cheng X., 2018, ECCV
[5]   Noise-Aware Unsupervised Deep Lidar-Stereo Fusion [J].
Cheng, Xuelian ;
Zhong, Yiran ;
Dai, Yuchao ;
Ji, Pan ;
Li, Hongdong .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :6332-6341
[6]  
Cho J., 2019, ARXIV190410230
[7]  
Durica M, 2017, GLOBALIZATION AND ITS SOCIO-ECONOMIC CONSEQUENCES, PTS I - VI, P378
[8]  
Eldesokey A., 2019, IEEE TPAMI
[9]   Vision meets robotics: The KITTI dataset [J].
Geiger, A. ;
Lenz, P. ;
Stiller, C. ;
Urtasun, R. .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (11) :1231-1237
[10]  
Hinton Geoffrey, 2015, ARXIV