DeCoTR: Enhancing Depth Completion with 2D and 3D Attentions

被引:0
|
作者
Shi, Yunxiao [1 ]
Singh, Manish Kumar [1 ]
Cai, Hong [1 ]
Porikli, Fatih [1 ]
机构
[1] Qualcomm AI Res, San Diego, CA 92121 USA
关键词
LEARNING DEPTH; NETWORK; VISION;
D O I
10.1109/CVPR52733.2024.01021
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we introduce a novel approach that harnesses both 2D and 3D attentions to enable highly accurate depth completion without requiring iterative spatial propagations. Specifically, we first enhance a baseline convolutional depth completion model by applying attention to 2D features in the bottleneck and skip connections. This effectively improves the performance of this simple network and sets it on par with the latest, complex transformer-based models. Leveraging the initial depths and features from this network, we uplift the 2D features to form a 3D point cloud and construct a 3D point transformer to process it, allowing the model to explicitly learn and exploit 3D geometric features. In addition, we propose normalization techniques to process the point cloud, which improves learning and leads to better accuracy than directly using point transformers off the shelf. Furthermore, we incorporate global attention on downsampled point cloud features, which enables long-range context while still being computationally feasible. We evaluate our method, DeCoTR, on established depth completion benchmarks, including NYU Depth V2 and KITTI, showcasing that it sets new state-of-the-art performance. We further conduct zero-shot evaluations on ScanNet and DDAD benchmarks and demonstrate that DeCoTR has superior generalizability compared to existing approaches.
引用
收藏
页码:10736 / 10746
页数:11
相关论文
共 50 条
  • [21] Depth sculpturing for 2D paintings: A progressive depth map completion framework
    Lin, Yu-Hsun
    Tsai, Ming-Hung
    Wu, Ja-Ling
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2014, 25 (04) : 670 - 678
  • [22] 21/2D or 3D?
    Roth, S
    Küster, B
    Sura, H
    KUNSTSTOFFE-PLAST EUROPE, 2004, 94 (07): : 65 - 67
  • [23] 2D and 3D on demand
    Philippi, Anne
    F & M; Feinwerktechnik, Mikrotechnik, Messtechnik, 1998, 106 (06): : 412 - 414
  • [24] From 2D to 3D
    Steven De Feyter
    Nature Chemistry, 2011, 3 (1) : 14 - 15
  • [25] POINT CLOUD COMPLETION BY MINIMIZING PREDICTION ERRORS IN BOTH 2D AND 3D SPACES
    Mok, Yeongheon
    Kim, Mingi
    Kim, Heegwang
    Paik, Joonki
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3171 - 3175
  • [26] Combining 2D and 3D Datasets with Object-Conditioned Depth Estimation
    Pauls, Jan-Hendrik
    Fehler, Richard
    Lauer, Martin
    Stiller, Christoph
    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 1194 - 1200
  • [27] 2D but not 3D: Pictorial-depth deficits in a case of visual agnosia
    Turnbull, OH
    Driver, J
    McCarthy, RA
    CORTEX, 2004, 40 (4-5) : 723 - 738
  • [28] 2D to 3D Image Conversion Based on Classification of Background Depth Profiles
    Lin, Guo-Shiang
    Liu, Han-Wen
    Chen, Wei-Chih
    Lie, Wen-Nung
    Huang, Sheng-Yen
    ADVANCES IN IMAGE AND VIDEO TECHNOLOGY, PT II, 2011, 7088 : 381 - +
  • [29] Motion-Based Depth Estimation for 2D to 3D Video Conversion
    Guo, Fan
    Tang, Jin
    Zou, Beiji
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2016, 20 (01) : 13 - 25
  • [30] A Depth Extraction Method Based On Motion and Geometry for 2D to 3D Conversion
    Huang, Xiaojun
    Wang, Lianghao
    Huang, Junjun
    Li, Dongxiao
    Zhang, Ming
    2009 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL 3, PROCEEDINGS, 2009, : 294 - 298