Discrete Cosine Transform Network for Guided Depth Map Super-Resolution

被引：66

作者：

Zhao, Zixiang ^{[1
,2
]}

Zhang, Jiangshe ^{[1
]}

Xu, Shuang ^{[1
,3
]}

Lin, Zudi ^{[2
]}

Pfister, Hanspeter ^{[2
]}

机构：

[1] Xi An Jiao Tong Univ, Xian, Peoples R China

[2] Harvard Univ, Cambridge, MA 02138 USA

[3] Northwestern Polytech Univ, Xian, Peoples R China

来源：

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) | 2022年

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1109/CVPR52688.2022.00561

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Guided depth super-resolution (GDSR) is an essential topic in multi-modal image processing, which reconstructs high-resolution (HR) depth maps from low-resolution ones collected with suboptimal conditions with the help of HR RGB images of the same scene. To solve the challenges in interpreting the working mechanism, extracting cross-modal features and RGB texture over-transferred, we propose a novel Discrete Cosine Transform Network (DCTNet) to alleviate the problems from three aspects. First, the Discrete Cosine Transform (DCT) module reconstructs the multi-channel HR depth features by using DCT to solve the channel-wise optimization problem derived from the image domain. Second, we introduce a semi-coupled feature extraction module that uses shared convolutional kernels to extract common information and private kernels to extract modality-specific information. Third, we employ an edge attention mechanism to highlight the contours informative for guided upsampling. Extensive quantitative and qualitative evaluations demonstrate the effectiveness of our DCTNet, which outperforms previous state-of-the-art methods with a relatively small number of parameters. The code is available at https:// github.com/Zhaozixiang1228/GDSR- DCTNet.

引用

页码：5687 / 5697

页数：11

共 64 条

[1] [Anonymous], 2007, CVPR
[2] [Anonymous], 2020, CVPR, DOI DOI 10.1109/CVPR42600.2020.00243
[3] [Anonymous], CVPR
[4] A database and evaluation methodology for optical flow
Baker, Simon
Scharstein, Daniel
Lewis, J. P.
Roth, Stefan
Black, Michael J.
Szeliski, Richard
[J]. 2007 IEEE 11TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1-6, 2007, : 588 - 595
[5] Depth-Color Fusion Strategy for 3-D Scene Modeling With Kinect
Camplani, Massimo
Mantecon, Tomas
Salgado, Luis
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2013, 43 (06) : 1560 - 1571
[6] Deep Coupled ISTA Network for Multi-Modal Image Super-Resolution
Deng, Xin
Dragotti, Pier Luigi
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 1683 - 1698
[7] Deng Xin, 2021, IEEE T PATTERN ANAL, V43, P3333
[8] Diebel J., 2005, P 18 INT C NEUR INF, V18, P291
[9] Learning a Deep Convolutional Network for Image Super-Resolution
Dong, Chao
Loy, Chen Change
He, Kaiming
Tang, Xiaoou
[J]. COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 : 184 - 199
[10] Image Guided Depth Upsampling using Anisotropic Total Generalized Variation
Ferstl, David
Reinbacher, Christian
Ranftl, Rene
Ruether, Matthias
Bischof, Horst
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 993 - 1000

← 1 2 3 4 5 6 7 →