A transformer-CNN parallel network for image guided depth completion

被引:5
|
作者
Li, Tao [1 ]
Dong, Xiucheng [1 ]
Lin, Jie [2 ]
Peng, Yonghong [3 ]
机构
[1] Xihua Univ, Sch Elect Engn & Elect Informat, Chengdu 610039, Peoples R China
[2] Xihua Univ, Sch Aeronaut & Astronaut, Chengdu 610039, Peoples R China
[3] Manchester Metropolitan Univ, Dept Comp & Math, Manchester M1 5GD, England
基金
中国国家自然科学基金;
关键词
Depth completion; Convolutional neural network; Transformer; Token correlation; Conditional random field;
D O I
10.1016/j.patcog.2024.110305
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image guided depth completion aims to predict a dense depth map from sparse depth measurements and the corresponding single color image. However, most state-of-the-art methods only rely on convolutional neural network (CNN) or transformer. In this paper, we propose a transformer -CNN parallel network (TCPNet) to integrate the advantages of CNN in local detail recovery and transformer in long-range semantic modeling. Specifically, our CNN branch adopts dense connection to strengthen feature propagation. Since the common transformer computes self -attention based on all the tokens in the window, no matter if they are relevant or not, this will inevitably introduce interferences and noises. To improve the self -attention accuracy, we propose a correlation -based transformer to only allow nearest neighbor tokens to participate in the self -attention computation. We also design a multi -scale conditional random field (CRF) module to implement multi -scale high -dimensional filtering for depth refinement. The comprehensive experimental results on KITTI and NYUv2 demonstrate that our method outperforms the state-of-the-art methods.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Hybrid transformer-CNN with boundary-awareness network for 3D medical image segmentation
    He, Jianfei
    Xu, Canhui
    APPLIED INTELLIGENCE, 2023, 53 (23) : 28542 - 28554
  • [22] Dual encoder network with transformer-CNN for multi-organ segmentation
    Hong, Zhifang
    Chen, Mingzhi
    Hu, Weijie
    Yan, Shiyu
    Qu, Aiping
    Chen, Lingna
    Chen, Junxi
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2023, 61 (03) : 661 - 671
  • [23] CTRNet: An Automatic Modulation Recognition Based on Transformer-CNN Neural Network
    Zhang, Wenna
    Xue, Kailiang
    Yao, Aiqin
    Sun, Yunqiang
    ELECTRONICS, 2024, 13 (17)
  • [24] Dual encoder network with transformer-CNN for multi-organ segmentation
    Zhifang Hong
    Mingzhi Chen
    Weijie Hu
    Shiyu Yan
    Aiping Qu
    Lingna Chen
    Junxi Chen
    Medical & Biological Engineering & Computing, 2023, 61 : 661 - 671
  • [25] Transformer-CNN hybrid network for improving PET time of flight prediction
    Feng, Xuhui
    Muhashi, Amanjule
    Onishi, Yuya
    Ota, Ryosuke
    Liu, Huafeng
    PHYSICS IN MEDICINE AND BIOLOGY, 2024, 69 (11):
  • [26] Revisiting Sparsity Invariant Convolution: A Network for Image Guided Depth Completion
    Yan, Lin
    Liu, Kai
    Belyaev, Evgeny
    IEEE ACCESS, 2020, 8 : 126323 - 126332
  • [27] TCI-UNet: transformer-CNN interactive module for medical image segmentation
    Bian, Xuan
    Wang, Guanglei
    Li, Yan
    Wang, Hongrui
    BIOMEDICAL OPTICS EXPRESS, 2023, 14 (11) : 5904 - 5920
  • [28] DTCNet: Transformer-CNN Distillation for Super-Resolution of Remote Sensing Image
    Lin, Cong
    Mao, Xin
    Qiu, Chenghao
    Zou, Lilan
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 11117 - 11133
  • [29] TransCNN-HAE: Transformer-CNN Hybrid AutoEncoder for Blind Image Inpainting
    Zhao, Haoru
    Gu, Zhaorui
    Zheng, Bing
    Zheng, Haiyong
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 6813 - 6821
  • [30] An Efficient Latent Style Guided Transformer-CNN Framework for Face Super-Resolution
    Qi, Haoran
    Qiu, Yuwei
    Luo, Xing
    Jin, Zhi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1589 - 1599