CostDCNet: Cost Volume Based Depth Completion for a Single RGB-D Image

被引:15
作者
Kam, Jaewon [1 ]
Kim, Jungeon [1 ]
Kim, Soongjin [1 ]
Park, Jaesik [1 ]
Lee, Seungyong [1 ]
机构
[1] POSTECH, Pohang Si, South Korea
来源
COMPUTER VISION - ECCV 2022, PT II | 2022年 / 13662卷
关键词
Depth completion; Cost volume; 3D convolution; Single RGB-D image;
D O I
10.1007/978-3-031-20086-1_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Successful depth completion from a single RGB-D image requires both extracting plentiful 2D and 3D features and merging these heterogeneous features appropriately. We propose a novel depth completion framework, CostDCNet, based on the cost volume-based depth estimation approach that has been successfully employed for multi-view stereo (MVS). The key to high-quality depth map estimation in the approach is constructing an accurate cost volume. To produce a quality cost volume tailored to single-view depth completion, we present a simple but effective architecture that can fully exploit the 3D information, three options to make an RGB-D feature volume, and per-plane pixel shuffle for efficient volume upsampling. Our CostDCNet framework consists of lightweight deep neural networks (similar to 1.8M parameters), running in real time (similar to 30 ms). Nevertheless, thanks to our simple but effective design, CostDCNet demonstrates depth completion results comparable to or better than the state-of-the-art methods.
引用
收藏
页码:257 / 274
页数:18
相关论文
共 49 条
[1]  
Boulch A., 2020, P ASIAN C COMPUTER V
[2]   SELF-SUPERVISED DEPTH COMPLETION VIA ADAPTIVE SAMPLING AND RELATIVE CONSISTENCY [J].
Cao, Zidong ;
Li, Ang ;
Yuan, Zejian .
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, :3263-3267
[3]  
Chang AE, 2017, Arxiv, DOI arXiv:1709.06158
[4]  
Chen Y, 2019, IEEE I CONF COMP VIS, P10022, DOI [10.1109/ICCV.2019.01012, 10.1109/iccv.2019.01012]
[5]   Learning Depth with Convolutional Spatial Propagation Network [J].
Cheng, Xinjing ;
Wang, Peng ;
Yang, Ruigang .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (10) :2361-2379
[6]   Depth Estimation via Affinity Learned with Convolutional Spatial Propagation Network [J].
Cheng, Xinjing ;
Wang, Peng ;
Yang, Ruigang .
COMPUTER VISION - ECCV 2018, PT XVI, 2018, 11220 :108-125
[7]  
Cheng XJ, 2020, AAAI CONF ARTIF INTE, V34, P10615
[8]   4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks [J].
Choy, Christopher ;
Gwak, JunYoung ;
Savarese, Silvio .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3070-3079
[9]  
Cicek Ozgun, 2016, Medical Image Computing and Computer-Assisted Intervention - MICCAI 2016. 19th International Conference. Proceedings: LNCS 9901, P424, DOI 10.1007/978-3-319-46723-8_49
[10]   3DMV: Joint 3D-Multi-view Prediction for 3D Semantic Scene Segmentation [J].
Dai, Angela ;
Niessner, Matthias .
COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 :458-474