PENet: Towards Precise and Efficient Image Guided Depth Completion

被引：144

作者：

Hu, Mu ^{[1
]}

Wang, Shuling ^{[1
]}

Li, Bin ^{[1
]}

Ning, Shiyu ^{[2
]}

Fan, Li ^{[2
]}

Gong, Xiaojin ^{[1
]}

机构：

[1] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou, Peoples R China

[2] Hisilicon, Huawei Shanghai, Dept Turing Solut, Shanghai, Peoples R China

来源：

2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021) | 2021年

关键词：

D O I：

10.1109/ICRA48506.2021.9561035

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Image guided depth completion is the task of generating a dense depth map from a sparse depth map and a high quality image. In this task, how to fuse the color and depth modalities plays an important role in achieving good performance. This paper proposes a two-branch backbone that consists of a color-dominant branch and a depth-dominant branch to exploit and fuse two modalities thoroughly. More specifically, one branch inputs a color image and a sparse depth map to predict a dense depth map. The other branch takes as inputs the sparse depth map and the previously predicted depth map, and outputs a dense depth map as well. The depth maps predicted from two branches are complimentary to each other and therefore they are adaptively fused. In addition, we also propose a simple geometric convolutional layer to encode 3D geometric cues. The geometric encoded backbone conducts the fusion of different modalities at multiple stages, leading to good depth completion results. We further implement a dilated and accelerated CSPN++ to refine the fused depth map efficiently. The proposed full model ranks 1st in the KITTI depth completion online leaderboard at the time of submission. It also infers much faster than most of the top ranked methods. The code of this work is available at https://github.com/JUGGHM/PENet.ICRA2021.

引用

页码：13656 / 13662

页数：7

共 32 条

[1]

[Anonymous], 2018, ECCV

[2] Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convolution [J].

Chen, Yunpeng ;

Fan, Haoqi ;

Xu, Bing ;

Yan, Zhicheng ;

Kalantidis, Yannis ;

Rohrbach, Marcus ;

Yan, Shuicheng ;

Feng, Jiashi .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3434-3443

[3]

Cheng X., 2020, AAAI

[4] The Cityscapes Dataset for Semantic Urban Scene Understanding [J].

Cordts, Marius ;

Omran, Mohamed ;

Ramos, Sebastian ;

Rehfeld, Timo ;

Enzweiler, Markus ;

Benenson, Rodrigo ;

Franke, Uwe ;

Roth, Stefan ;

Schiele, Bernt .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223

[5]

Eigen D, 2014, ADV NEUR IN, V27

[6]

Eldesokey A., 2020, CVPR

[7]

Eldesokey A., 2018, British Mach. Vision Conf. (BMVC)

[8]

Gansbeke W. V., 2019, MAV

[9]

Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074

[10] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

← 1 2 3 4 →