Boosting lightweight Single Image Super-resolution via Joint-distillation

被引：10

作者：

Luo, Xiaotong ^{[1
]}

Liang, Qiuyuan ^{[1
]}

Liu, Ding ^{[2
]}

Qu, Yanyun ^{[1
]}

机构：

[1] Xiamen Univ, Fujian Key Lab Sensing & Comp Smart City, Sch Informat, Comp Sci Dept, Xiamen, Peoples R China

[2] Bytedance Inc, Mountain View, CA USA

来源：

PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021 | 2021年

基金：

中国国家自然科学基金;

关键词：

Image super-resolution; self-distillation; meta-learning; mutual; learning; NETWORK;

D O I：

10.1145/3474085.3475288

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The rising of deep learning has facilitated the development of single image super-resolution (SISR). However, the growing burdensome model complexity and memory occupation severely hinder its practical deployments on resource-limited devices. In this paper, we propose a novel joint-distillation ( JDSR) framework to boost the representation of various off-the-shelf lightweight SR models. The framework includes two stages: the superior LR generation and the joint-distillation learning. The superior LR is obtained from the HR image itself. With less than 300K parameters, the peer network using superior LR as input can achieve comparable SR performance with large models, e.g., RCAN, with 15M parameters, which enables it as the input of peer network to save the training expense. The joint-distillation learning consists of internal self-distillation and external mutual learning. The internal self-distillation aims to achieve model self-boosting by transferring the knowledge from the deeper SR output to the shallower one. Specifically, each intermediate SR output is supervised by the HR image and the soft label from subsequent deeper outputs. To shrink the capacity gap between shallow and deep layers, a soft label generator is designed in a progressive backward fusion way with meta-learning for adaptive weight fine-tuning. The external mutual learning focuses on obtaining interaction information from a peer network in the process. Moreover, a curriculum learning strategy and a performance gap threshold are introduced for balancing the convergence rate of the original SR model and its peer network. Comprehensive experiments on benchmark datasets demonstrate that our proposal improves the performance of recent lightweight SR models by a large margin, with the same model architecture and inference expense.

引用

页码：1535 / 1543

页数：9

共 40 条

[1] NTIRE 2017 Challenge on Single Image Super-Resolution: Dataset and Study [J].

Agustsson, Eirikur ;

Timofte, Radu .

2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, :1122-1131

[2] Fast, Accurate, and Lightweight Super-Resolution with Cascading Residual Network [J].

Ahn, Namhyuk ;

Kang, Byungkon ;

Sohn, Kyung-Ah .

COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 :256-272

[3] Variational Information Distillation for Knowledge Transfer [J].

Ahn, Sungsoo ;

Hu, Shell Xu ;

Damianou, Andreas ;

Lawrence, Neil D. ;

Dai, Zhenwen .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :9155-9163

[4]

[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.01199

[5]

[Anonymous], PR MACH LEARN RES

[6]

[Anonymous], PROC CVPR IEEE

[7]

[Anonymous], PROC CVPR IEEE

[8] Contour Detection and Hierarchical Image Segmentation [J].

Arbelaez, Pablo ;

Maire, Michael ;

Fowlkes, Charless ;

Malik, Jitendra .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (05) :898-916

[9] Low-Complexity Single-Image Super-Resolution based on Nonnegative Neighbor Embedding [J].

Bevilacqua, Marco ;

Roumy, Aline ;

Guillemot, Christine ;

Morel, Marie-Line Alberi .

PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,

[10]

Chu Xiangxiang, 2019, ARXIV190107261

← 1 2 3 4 →