Channel Attention based Iterative Residual Learning for Depth Map Super-Resolution

被引：72

作者：

Song, Xibin ^{[1
,2
]}

Dai, Yuchao ^{[3
]}

Zhou, Dingfu ^{[1
,2
]}

Liu, Liu ^{[5
,6
]}

Li, Wei ^{[4
]}

Li, Hongdong ^{[5
,6
]}

Yang, Ruigang ^{[1
,2
,7
]}

机构：

[1] Baidu Res, Beijing, Peoples R China

[2] Natl Engn Lab Deep Learning Technol & Applicat, Beijing, Peoples R China

[3] Northwestern Polytech Univ, Xian, Peoples R China

[4] Shandong Univ, Jinan, Peoples R China

[5] Australian Natl Univ, Canberra, ACT, Australia

[6] Australian Ctr Robot Vis, Brisbane, Qld, Australia

[7] Univ Kentucky, Lexington, KY USA

来源：

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2020年

关键词：

IMAGE; NETWORK;

D O I：

10.1109/CVPR42600.2020.00567

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Despite the remarkable progresses made in deep-learning based depth map super-resolution (DSR), how to tackle real-world degradation in low-resolution (LR) depth maps remains a major challenge. Existing DSR model is generally trained and tested on synthetic dataset, which is very different from what would get from a real depth sensor. In this paper, we argue that DSR models trained under this setting are restrictive and not effective in dealing with real-world DSR tasks. We make two contributions in tackling real-world degradation of different depth sensors. First, we propose to classify the generation of LR depth maps into two types: non-linear downsampling with noise and interval downsampling, for which DSR models are learned correspondingly. Second, we propose a new framework for real-world DSR, which consists of four modules : 1) An iterative residual learning module with deep supervision to learn effective high frequency components of depth maps in a coarse-to-fine manner; 2) A channel attention strategy to enhance channels with abundant high frequency components; 3) A multi-stage fusion module to effectively reexploit the results in the coarse-to-fine process; and 4) A depth refinement module to improve the depth map by TGV regularization and input loss. Extensive experiments on benchmarking datasets demonstrate the superiority of our method over current state-of-the-art DSR methods.

引用

页码：5630 / 5639

页数：10

共 52 条

[31]

Matsuo K, 2015, PROC CVPR IEEE, P3574, DOI 10.1109/CVPR.2015.7298980

[32] A MULTIPLE-BASE-LINE STEREO [J].

OKUTOMI, M ;

KANADE, T .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1993, 15 (04) :353-363

[33] Embedded Block Residual Network: A Recursive Restoration Model for Single-Image Super-Resolution [J].

Qiu, Yajun ;

Wang, Ruxin ;

Tao, Dapeng ;

Cheng, Jun .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :4179-4188

[34] ATGV-Net: Accurate Depth Super-Resolution [J].

Riegler, Gernot ;

Ruether, Matthias ;

Bischof, Horst .

COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 :268-284

[35]

Riegler Matthias Ruther Gernot, 2016, P BRIT MACH VIS C BM

[36] EnhanceNet: Single Image Super-Resolution Through Automated Texture Synthesis [J].

Sajjadi, Mehdi S. M. ;

Schoelkopf, Bernhard ;

Hirsch, Michael .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4501-4510

[37] A taxonomy and evaluation of dense two-frame stereo correspondence algorithms [J].

Scharstein, D ;

Szeliski, R .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2002, 47 (1-3) :7-42

[38] High-Resolution Stereo Datasets with Subpixel-Accurate Ground Truth [J].

Scharstein, Daniel ;

Hirschmuller, Heiko ;

Kitajima, York ;

Krathwohl, Greg ;

Nesic, Nera ;

Wang, Xi ;

Westling, Porter .

PATTERN RECOGNITION, GCPR 2014, 2014, 8753 :31-42

[39]

Song SR, 2015, PROC CVPR IEEE, P567, DOI 10.1109/CVPR.2015.7298655

[40] Deeply Supervised Depth Map Super-Resolution as Novel View Synthesis [J].

Song, Xibin ;

Dai, Yuchao ;

Qin, Xueying .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (08) :2323-2336

← 1 2 3 4 5 6 →