Structured Sparsity Learning for Efficient Video Super-Resolution

被引：18

作者：

Xia, Bin ^{[1
]}

He, Jingwen ^{[2
]}

Zhang, Yulun ^{[3
]}

Wang, Yitong ^{[4
]}

Tian, Yapeng ^{[5
]}

Yang, Wenming ^{[1
]}

Van Gool, Luc ^{[3
]}

机构：

[1] Tsinghua Univ, Beijing, Peoples R China

[2] Shanghai AI Lab, Shanghai, Peoples R China

[3] Swiss Fed Inst Technol, Zurich, Switzerland

[4] ByteDance Inc, Beijing, Peoples R China

[5] Univ Texas Dallas, Dallas, TX USA

来源：

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2023年

关键词：

D O I：

10.1109/CVPR52729.2023.02168

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The high computational costs of video super-resolution (VSR) models hinder their deployment on resource-limited devices, e.g., smartphones and drones. Existing VSR models contain considerable redundant filters, which drag down the inference efficiency. To prune these unimportant filters, we develop a structured pruning scheme called Structured Sparsity Learning (SSL) according to the properties of VSR. In SSL, we design pruning schemes for several key components in VSR models, including residual blocks, recurrent networks, and upsampling networks. Specifically, we develop a Residual Sparsity Connection (RSC) scheme for residual blocks of recurrent networks to liberate pruning restrictions and preserve the restoration information. For upsampling networks, we design a pixel-shuffle pruning scheme to guarantee the accuracy of feature channel-space conversion. In addition, we observe that pruning error would be amplified as the hidden states propagate along with recurrent networks. To alleviate the issue, we design Temporal Finetuning (TF). Extensive experiments show that SSL can significantly outperform recent methods quantitatively and qualitatively. The code is available at https://github.com/Zj-BinXia/SSL.

引用

页码：22638 / 22647

页数：10

共 56 条

[1]

[Anonymous], ICCVW

[2]

[Anonymous], ICCV

[3]

[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.01077

[4] Real-Time Video Super-Resolution with Spatio-Temporal Networks and Motion Compensation [J].

Caballero, Jose ;

Ledig, Christian ;

Aitken, Andrew ;

Acosta, Alejandro ;

Totz, Johannes ;

Wang, Zehan ;

Shi, Wenzhe .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2848-2857

[5]

Chan Kelvin CK, 2021, CVPR

[6]

Chan Kelvin CK, 2021, ARXIV210413371

[7] Guest Editorial: Special Issue on Computational Intelligence for Cloud Computing [J].

Cheng, H. ;

Yang, S. ;

Yao, X. ;

Zhang, M. .

IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2018, 2 (01) :1-2

[8]

Cheng Jian, 2018, FRONTIERS INFORM TEC, V1, P2

[9] Deformable Convolutional Networks [J].

Dai, Jifeng ;

Qi, Haozhi ;

Xiong, Yuwen ;

Li, Yi ;

Zhang, Guodong ;

Hu, Han ;

Wei, Yichen .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773

[10]

Ding X., 2019, ICML

← 1 2 3 4 5 6 →