Fast Spatio-Temporal Residual Network for Video Super-Resolution

被引:158
作者
Li, Sheng [1 ]
He, Fengxiang [2 ]
Du, Bo [1 ]
Zhang, Lefei [1 ]
Xu, Yonghao [3 ]
Tao, Dacheng [2 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan, Hubei, Peoples R China
[2] Univ Sydney, UBTECH Sydney AI Ctr, FEIT, SCS, Sydney, Australia
[3] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan, Hubei, Peoples R China
来源
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年
基金
澳大利亚研究理事会; 中国国家自然科学基金;
关键词
CONVOLUTIONAL NETWORK;
D O I
10.1109/CVPR.2019.01077
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, deep learning based video super-resolution (SR) methods have achieved promising performance. To simultaneously exploit the spatial and temporal information of videos, employing 3-dimensional (3D) convolutions is a natural approach. However, straight utilizing 3D convolutions may lead to an excessively high computational complexity which restricts the depth of video SR models and thus undermine the performance. In this paper, we present a novel fast spatio-temporal residual network (FSTRN) to adopt 3D convolutions for the video SR task in order to enhance the performance while maintaining a low computational load. Specifically, we propose a fast spatio-temporal residual block (FRB) that divide each 3D filter to the product of two 3D filters, which have considerably lower dimensions. Furthermore, we design a cross-space residual learning that directly links the low-resolution space and the high-resolution space, which can greatly relieve the computational burden on the feature fusion and up-scaling parts. Extensive evaluations and comparisons on benchmark datasets validate the strengths of the proposed approach and demonstrate that the proposed network significantly outperforms the current state-of-the-art methods.
引用
收藏
页码:10514 / 10523
页数:10
相关论文
共 46 条
[1]  
[Anonymous], P 3 INT C LEARNING R
[2]  
[Anonymous], 2013, IEEE T PATTERN ANAL, DOI DOI 10.1109/TPAMI.2012.59
[3]  
[Anonymous], PROC CVPR IEEE
[4]  
[Anonymous], CORR
[5]  
[Anonymous], 2017, COMMUN ACM, DOI DOI 10.1145/3065386
[6]  
[Anonymous], CVPR
[7]  
[Anonymous], 2017, CORR
[8]  
[Anonymous], 2017, CORR
[9]  
Bartlett P. L., 2003, Journal of Machine Learning Research, V3, P463, DOI 10.1162/153244303321897690
[10]  
Bartlett Peter L., 2017, Advances in Neural Information Processing Systems, V30