Discrete Cosine Transform Network for Guided Depth Map Super-Resolution

被引:66
作者
Zhao, Zixiang [1 ,2 ]
Zhang, Jiangshe [1 ]
Xu, Shuang [1 ,3 ]
Lin, Zudi [2 ]
Pfister, Hanspeter [2 ]
机构
[1] Xi An Jiao Tong Univ, Xian, Peoples R China
[2] Harvard Univ, Cambridge, MA 02138 USA
[3] Northwestern Polytech Univ, Xian, Peoples R China
来源
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) | 2022年
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52688.2022.00561
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Guided depth super-resolution (GDSR) is an essential topic in multi-modal image processing, which reconstructs high-resolution (HR) depth maps from low-resolution ones collected with suboptimal conditions with the help of HR RGB images of the same scene. To solve the challenges in interpreting the working mechanism, extracting cross-modal features and RGB texture over-transferred, we propose a novel Discrete Cosine Transform Network (DCTNet) to alleviate the problems from three aspects. First, the Discrete Cosine Transform (DCT) module reconstructs the multi-channel HR depth features by using DCT to solve the channel-wise optimization problem derived from the image domain. Second, we introduce a semi-coupled feature extraction module that uses shared convolutional kernels to extract common information and private kernels to extract modality-specific information. Third, we employ an edge attention mechanism to highlight the contours informative for guided upsampling. Extensive quantitative and qualitative evaluations demonstrate the effectiveness of our DCTNet, which outperforms previous state-of-the-art methods with a relatively small number of parameters. The code is available at https:// github.com/Zhaozixiang1228/GDSR- DCTNet.
引用
收藏
页码:5687 / 5697
页数:11
相关论文
共 64 条
  • [1] [Anonymous], 2007, CVPR
  • [2] [Anonymous], 2020, CVPR, DOI DOI 10.1109/CVPR42600.2020.00243
  • [3] [Anonymous], CVPR
  • [4] A database and evaluation methodology for optical flow
    Baker, Simon
    Scharstein, Daniel
    Lewis, J. P.
    Roth, Stefan
    Black, Michael J.
    Szeliski, Richard
    [J]. 2007 IEEE 11TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1-6, 2007, : 588 - 595
  • [5] Depth-Color Fusion Strategy for 3-D Scene Modeling With Kinect
    Camplani, Massimo
    Mantecon, Tomas
    Salgado, Luis
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2013, 43 (06) : 1560 - 1571
  • [6] Deep Coupled ISTA Network for Multi-Modal Image Super-Resolution
    Deng, Xin
    Dragotti, Pier Luigi
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 1683 - 1698
  • [7] Deng Xin, 2021, IEEE T PATTERN ANAL, V43, P3333
  • [8] Diebel J., 2005, P 18 INT C NEUR INF, V18, P291
  • [9] Learning a Deep Convolutional Network for Image Super-Resolution
    Dong, Chao
    Loy, Chen Change
    He, Kaiming
    Tang, Xiaoou
    [J]. COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 : 184 - 199
  • [10] Image Guided Depth Upsampling using Anisotropic Total Generalized Variation
    Ferstl, David
    Reinbacher, Christian
    Ranftl, Rene
    Ruether, Matthias
    Bischof, Horst
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 993 - 1000