Fast Spatio-Temporal Residual Network for Video Super-Resolution

被引：158

作者：

Li, Sheng ^{[1
]}

He, Fengxiang ^{[2
]}

Du, Bo ^{[1
]}

Zhang, Lefei ^{[1
]}

Xu, Yonghao ^{[3
]}

Tao, Dacheng ^{[2
]}

机构：

[1] Wuhan Univ, Sch Comp Sci, Wuhan, Hubei, Peoples R China

[2] Univ Sydney, UBTECH Sydney AI Ctr, FEIT, SCS, Sydney, Australia

[3] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan, Hubei, Peoples R China

来源：

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年

基金：

澳大利亚研究理事会; 中国国家自然科学基金;

关键词：

CONVOLUTIONAL NETWORK;

D O I：

10.1109/CVPR.2019.01077

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently, deep learning based video super-resolution (SR) methods have achieved promising performance. To simultaneously exploit the spatial and temporal information of videos, employing 3-dimensional (3D) convolutions is a natural approach. However, straight utilizing 3D convolutions may lead to an excessively high computational complexity which restricts the depth of video SR models and thus undermine the performance. In this paper, we present a novel fast spatio-temporal residual network (FSTRN) to adopt 3D convolutions for the video SR task in order to enhance the performance while maintaining a low computational load. Specifically, we propose a fast spatio-temporal residual block (FRB) that divide each 3D filter to the product of two 3D filters, which have considerably lower dimensions. Furthermore, we design a cross-space residual learning that directly links the low-resolution space and the high-resolution space, which can greatly relieve the computational burden on the feature fusion and up-scaling parts. Extensive evaluations and comparisons on benchmark datasets validate the strengths of the proposed approach and demonstrate that the proposed network significantly outperforms the current state-of-the-art methods.

引用

页码：10514 / 10523

页数：10

共 46 条

[21] Video Super-Resolution via Bidirectional Recurrent Convolutional Networks [J].

Huang, Yan ;

Wang, Wei ;

Wang, Liang .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :1015-1028

[22]

Huang Yan, 2015, ADV NEURAL INFORM PR

[23]

Jiao J., 2018, AAAI

[24] Deep Video Super-Resolution Network Using Dynamic Upsampling Filters Without Explicit Motion Compensation [J].

Jo, Younghyun ;

Oh, Seoung Wug ;

Kang, Jaeyeon ;

Kim, Seon Joo .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :3224-3232

[25] Video Super-Resolution With Convolutional Neural Networks [J].

Kappeler, Armin ;

Yoo, Seunghwan ;

Dai, Qiqin ;

Katsaggelos, Aggelos K. .

IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2016, 2 (02) :109-122

[26]

Kim J, 2016, PROC CVPR IEEE, P1637, DOI [10.1109/CVPR.2016.181, 10.1109/CVPR.2016.182]

[27]

Kingma DP, 2014, ARXIV

[28] Gradient-based learning applied to document recognition [J].

Lecun, Y ;

Bottou, L ;

Bengio, Y ;

Haffner, P .

PROCEEDINGS OF THE IEEE, 1998, 86 (11) :2278-2324

[29] Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network [J].

Ledig, Christian ;

Theis, Lucas ;

Huszar, Ferenc ;

Caballero, Jose ;

Cunningham, Andrew ;

Acosta, Alejandro ;

Aitken, Andrew ;

Tejani, Alykhan ;

Totz, Johannes ;

Wang, Zehan ;

Shi, Wenzhe .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :105-114

[30] Video Super-Resolution via Deep Draft-Ensemble Learning [J].

Liao, Renjie ;

Tao, Xin ;

Li, Ruiyu ;

Ma, Ziyang ;

Jia, Jiaya .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :531-539

← 1 2 3 4 5 →