An Effective Multi-Task Two-Stage Network with the Cross-Scale Training Strategy for Multi-Scale Image Super Resolution

被引：2

作者：

Yang, Jucheng ^{[1
,3
]}

Wei, Feng ^{[1
,2
]}

Bai, Yaxin ^{[1
]}

Zuo, Meiran ^{[1
]}

Sun, Xiao ^{[1
]}

Chen, Yarui ^{[1
]}

机构：

[1] Tianjin Univ Sci & Technol, Coll Artificial Intelligence, Tianjin 300457, Peoples R China

[2] Tianjin Univ Sci & Technol, Coll Mech Engn, Tianjin 300457, Peoples R China

[3] 9 13th St, Tianjin 300457, Peoples R China

来源：

ELECTRONICS | 2021年 / 10卷 / 19期

关键词：

CNN; per-pixel loss; HVS; EMTCM; multi-task co-optimization; cross-scale training; QUALITY ASSESSMENT;

D O I：

10.3390/electronics10192434

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Convolutional neural networks and the per-pixel loss function have shown their potential to be the best combination for super-resolving severely degraded images. However, there are still challenges, such as the massive number of parameters requiring prohibitive memory and vast computing and storage resources as well as time-consuming training and testing. What is more, the per-pixel loss measured by L2 and the Peak Signal-to-Noise Ratio do not correlate well with human perception of image quality, since L2 simply does not capture the intricate characteristics of human visual systems. To address these issues, we propose an effective two-stage hourglass network with multi-task co-optimization, which enables the entire network to focus on training and testing time and inherent image patterns such as local luminance, contrast, structure and data distribution. Moreover, to avoid overwhelming memory overheads, our model is capable of performing real-time single image multi-scale super-resolution, so it is memory-friendly, meaning that memory space is utilized efficiently. In addition, in order to best use the underlying structure and perception of image quality and the intermediate estimates during the inference process, we introduce a cross-scale training strategy with 2x, 3x and 4x image super-resolution. This effective multi-task two-stage network with the cross-scale strategy for multi-scale image super-resolution is named EMTCM. Quantitative and qualitative experiment results show that the proposed EMTCM network outperforms state-of-the-art methods in recovering high-quality images.

引用

页数：12

共 43 条

[1]

Ajith M., 2020, ARXIV200404093

[2]

Bigdeli SA, 2017, ADV NEUR IN, V30

[3] Attention-Aware Face Hallucination via Deep Reinforcement Learning [J].

Cao, Qingxing ;

Lin, Liang ;

Shi, Yukai ;

Liang, Xiaodan ;

Li, Guanbin .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1656-1664

[4] Accelerating the Super-Resolution Convolutional Neural Network [J].

Dong, Chao ;

Loy, Chen Change ;

Tang, Xiaoou .

COMPUTER VISION - ECCV 2016, PT II, 2016, 9906 :391-407

[5] Image Super-Resolution Using Deep Convolutional Networks [J].

Dong, Chao ;

Loy, Chen Change ;

He, Kaiming ;

Tang, Xiaoou .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (02) :295-307

[6] Learning a Deep Convolutional Network for Image Super-Resolution [J].

Dong, Chao ;

Loy, Chen Change ;

He, Kaiming ;

Tang, Xiaoou .

COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 :184-199

[7] Nonlocally Centralized Sparse Representation for Image Restoration [J].

Dong, Weisheng ;

Zhang, Lei ;

Shi, Guangming ;

Li, Xin .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (04) :1618-1628

[8]

Farooq M. A., 2021, C MULTIMEDIA INTERAC, V1376, P79, DOI [10.48550/arXiv.2107.04133, DOI 10.48550/ARXIV.2107.04133]

[9] Closed-loop Matters: Dual Regression Networks for Single Image Super-Resolution [J].

Guo, Yong ;

Chen, Jian ;

Wang, Jingdong ;

Chen, Qi ;

Cao, Jiezhang ;

Deng, Zeshuai ;

Xu, Yanwu ;

Tan, Mingkui .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :5406-5415

[10]

Haris M., 2018, NEURAL INFORM PROCES

← 1 2 3 4 5 →