Channel Attention based Iterative Residual Learning for Depth Map Super-Resolution

被引:72
作者
Song, Xibin [1 ,2 ]
Dai, Yuchao [3 ]
Zhou, Dingfu [1 ,2 ]
Liu, Liu [5 ,6 ]
Li, Wei [4 ]
Li, Hongdong [5 ,6 ]
Yang, Ruigang [1 ,2 ,7 ]
机构
[1] Baidu Res, Beijing, Peoples R China
[2] Natl Engn Lab Deep Learning Technol & Applicat, Beijing, Peoples R China
[3] Northwestern Polytech Univ, Xian, Peoples R China
[4] Shandong Univ, Jinan, Peoples R China
[5] Australian Natl Univ, Canberra, ACT, Australia
[6] Australian Ctr Robot Vis, Brisbane, Qld, Australia
[7] Univ Kentucky, Lexington, KY USA
来源
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2020年
关键词
IMAGE; NETWORK;
D O I
10.1109/CVPR42600.2020.00567
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the remarkable progresses made in deep-learning based depth map super-resolution (DSR), how to tackle real-world degradation in low-resolution (LR) depth maps remains a major challenge. Existing DSR model is generally trained and tested on synthetic dataset, which is very different from what would get from a real depth sensor. In this paper, we argue that DSR models trained under this setting are restrictive and not effective in dealing with real-world DSR tasks. We make two contributions in tackling real-world degradation of different depth sensors. First, we propose to classify the generation of LR depth maps into two types: non-linear downsampling with noise and interval downsampling, for which DSR models are learned correspondingly. Second, we propose a new framework for real-world DSR, which consists of four modules : 1) An iterative residual learning module with deep supervision to learn effective high frequency components of depth maps in a coarse-to-fine manner; 2) A channel attention strategy to enhance channels with abundant high frequency components; 3) A multi-stage fusion module to effectively reexploit the results in the coarse-to-fine process; and 4) A depth refinement module to improve the depth map by TGV regularization and input loss. Extensive experiments on benchmarking datasets demonstrate the superiority of our method over current state-of-the-art DSR methods.
引用
收藏
页码:5630 / 5639
页数:10
相关论文
共 52 条
[1]  
[Anonymous], 2019, IEEE INT C COMP VIS
[2]   A database and evaluation methodology for optical flow [J].
Baker, Simon ;
Scharstein, Daniel ;
Lewis, J. P. ;
Roth, Stefan ;
Black, Michael J. ;
Szeliski, Richard .
2007 IEEE 11TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1-6, 2007, :588-595
[3]  
Barnard Stephen T, 1982, TECHNICAL REPORT
[4]   A Naturalistic Open Source Movie for Optical Flow Evaluation [J].
Butler, Daniel J. ;
Wulff, Jonas ;
Stanley, Garrett B. ;
Black, Michael J. .
COMPUTER VISION - ECCV 2012, PT VI, 2012, 7577 :611-625
[5]   Toward Real-World Single Image Super-Resolution: A New Benchmark and A New Model [J].
Cai, Jianrui ;
Zeng, Hui ;
Yong, Hongwei ;
Cao, Zisheng ;
Zhang, Lei .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3086-3095
[6]   Second-order Attention Network for Single Image Super-Resolution [J].
Dai, Tao ;
Cai, Jianrui ;
Zhang, Yongbing ;
Xia, Shu-Tao ;
Zhang, Lei .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11057-11066
[7]   Guided Super-Resolution as Pixel-to-Pixel Transformation [J].
de Lutio, Riccardo ;
D'Aronco, Stefano ;
Wegner, Jan Dirk ;
Schindler, Konrad .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :8828-8836
[8]   Wavelet Domain Style Transfer for an Effective Perception-distortion Tradeoff in Single Image Super-Resolution [J].
Deng, Xin ;
Yang, Ren ;
Xu, Mai ;
Dragotti, Pier Luigi .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3076-3085
[9]   Learning a Deep Convolutional Network for Image Super-Resolution [J].
Dong, Chao ;
Loy, Chen Change ;
He, Kaiming ;
Tang, Xiaoou .
COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 :184-199
[10]   Variational Depth Superresolution using Example-Based Edge Representations [J].
Ferstl, David ;
Ruether, Matthias ;
Bischof, Horst .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :513-521