Channel Attention based Iterative Residual Learning for Depth Map Super-Resolution

被引:72
作者
Song, Xibin [1 ,2 ]
Dai, Yuchao [3 ]
Zhou, Dingfu [1 ,2 ]
Liu, Liu [5 ,6 ]
Li, Wei [4 ]
Li, Hongdong [5 ,6 ]
Yang, Ruigang [1 ,2 ,7 ]
机构
[1] Baidu Res, Beijing, Peoples R China
[2] Natl Engn Lab Deep Learning Technol & Applicat, Beijing, Peoples R China
[3] Northwestern Polytech Univ, Xian, Peoples R China
[4] Shandong Univ, Jinan, Peoples R China
[5] Australian Natl Univ, Canberra, ACT, Australia
[6] Australian Ctr Robot Vis, Brisbane, Qld, Australia
[7] Univ Kentucky, Lexington, KY USA
来源
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2020年
关键词
IMAGE; NETWORK;
D O I
10.1109/CVPR42600.2020.00567
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the remarkable progresses made in deep-learning based depth map super-resolution (DSR), how to tackle real-world degradation in low-resolution (LR) depth maps remains a major challenge. Existing DSR model is generally trained and tested on synthetic dataset, which is very different from what would get from a real depth sensor. In this paper, we argue that DSR models trained under this setting are restrictive and not effective in dealing with real-world DSR tasks. We make two contributions in tackling real-world degradation of different depth sensors. First, we propose to classify the generation of LR depth maps into two types: non-linear downsampling with noise and interval downsampling, for which DSR models are learned correspondingly. Second, we propose a new framework for real-world DSR, which consists of four modules : 1) An iterative residual learning module with deep supervision to learn effective high frequency components of depth maps in a coarse-to-fine manner; 2) A channel attention strategy to enhance channels with abundant high frequency components; 3) A multi-stage fusion module to effectively reexploit the results in the coarse-to-fine process; and 4) A depth refinement module to improve the depth map by TGV regularization and input loss. Extensive experiments on benchmarking datasets demonstrate the superiority of our method over current state-of-the-art DSR methods.
引用
收藏
页码:5630 / 5639
页数:10
相关论文
共 52 条
[31]  
Matsuo K, 2015, PROC CVPR IEEE, P3574, DOI 10.1109/CVPR.2015.7298980
[32]   A MULTIPLE-BASE-LINE STEREO [J].
OKUTOMI, M ;
KANADE, T .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1993, 15 (04) :353-363
[33]   Embedded Block Residual Network: A Recursive Restoration Model for Single-Image Super-Resolution [J].
Qiu, Yajun ;
Wang, Ruxin ;
Tao, Dapeng ;
Cheng, Jun .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :4179-4188
[34]   ATGV-Net: Accurate Depth Super-Resolution [J].
Riegler, Gernot ;
Ruether, Matthias ;
Bischof, Horst .
COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 :268-284
[35]  
Riegler Matthias Ruther Gernot, 2016, P BRIT MACH VIS C BM
[36]   EnhanceNet: Single Image Super-Resolution Through Automated Texture Synthesis [J].
Sajjadi, Mehdi S. M. ;
Schoelkopf, Bernhard ;
Hirsch, Michael .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4501-4510
[37]   A taxonomy and evaluation of dense two-frame stereo correspondence algorithms [J].
Scharstein, D ;
Szeliski, R .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2002, 47 (1-3) :7-42
[38]   High-Resolution Stereo Datasets with Subpixel-Accurate Ground Truth [J].
Scharstein, Daniel ;
Hirschmuller, Heiko ;
Kitajima, York ;
Krathwohl, Greg ;
Nesic, Nera ;
Wang, Xi ;
Westling, Porter .
PATTERN RECOGNITION, GCPR 2014, 2014, 8753 :31-42
[39]  
Song SR, 2015, PROC CVPR IEEE, P567, DOI 10.1109/CVPR.2015.7298655
[40]   Deeply Supervised Depth Map Super-Resolution as Novel View Synthesis [J].
Song, Xibin ;
Dai, Yuchao ;
Qin, Xueying .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (08) :2323-2336