Learning for Unconstrained Space-Time Video Super-Resolution

被引:27
作者
Shi, Zhihao [1 ]
Liu, Xiaohong [2 ]
Li, Chengqi [3 ]
Dai, Linhui [1 ]
Chen, Jun [1 ]
Davidson, Timothy N. [1 ]
Zhao, Jiying [4 ]
机构
[1] McMaster Univ, Dept Elect & Comp Engn, Hamilton, ON L8S 4K1, Canada
[2] Shanghai Jiao Tong Univ, John Hoperoft Ctr, Shanghai 200240, Peoples R China
[3] McMaster Univ, Dept Comp & Software, Hamilton, ON L8S 4K1, Canada
[4] Univ Ottawa, Sch Elect Engn & Comp Sci, Ottawa, ON K1N 6N5, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Superresolution; Optical imaging; Spatial resolution; Kernel; Convolution; Interpolation; Deconvolution; Space-time video super-resolution; arbitrary temporal; spatial factors; optical flow; generalized pixelshuffle layer; IMAGE SUPERRESOLUTION; NETWORK; RESOLUTION;
D O I
10.1109/TBC.2021.3131875
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recent years have seen considerable research activities devoted to video enhancement that simultaneously increases temporal frame rate and spatial resolution. However, the existing methods either fail to explore the intrinsic relationship between temporal and spatial information or lack flexibility in the choice of final temporal/spatial resolution. In this work, we propose an unconstrained space-time video super-resolution network, which can effectively exploit space-time correlation to boost performance. Moreover, it has complete freedom in adjusting the temporal frame rate and spatial resolution through the use of the optical flow technique and a generalized pixelshuffle operation. Our extensive experiments demonstrate that the proposed method not only outperforms the state-of-the-art, but also requires far fewer parameters and less running time.
引用
收藏
页码:345 / 358
页数:14
相关论文
共 70 条
[21]   Perceptual Losses for Real-Time Style Transfer and Super-Resolution [J].
Johnson, Justin ;
Alahi, Alexandre ;
Li Fei-Fei .
COMPUTER VISION - ECCV 2016, PT II, 2016, 9906 :694-711
[22]   Content-Aware Summarization of Broadcast Sports Videos: An Audio-Visual Feature Extraction Approach [J].
Khan, Abdullah Aman ;
Shao, Jie ;
Ali, Waqar ;
Tumrani, Saifullah .
NEURAL PROCESSING LETTERS, 2020, 52 (03) :1945-1968
[23]  
Kim J, 2016, PROC CVPR IEEE, P1637, DOI [10.1109/CVPR.2016.181, 10.1109/CVPR.2016.182]
[24]  
King DB, 2015, ACS SYM SER, V1214, P1, DOI 10.1021/bk-2015-1214.ch001
[25]   AdaCoF: Adaptive Collaboration of Flows for Video Frame Interpolation [J].
Lee, Hyeongmin ;
Kim, Taeoh ;
Chung, Tae-young ;
Pak, Daehyun ;
Ban, Yuseok ;
Lee, Sangyoun .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :5315-5324
[26]   Deep Multi-Domain Prediction for 3D Video Coding [J].
Lei, Jianjun ;
Shi, Yanan ;
Pan, Zhaoqing ;
Liu, Dong ;
Jin, Dengchao ;
Chen, Ying ;
Ling, Nam .
IEEE TRANSACTIONS ON BROADCASTING, 2021, 67 (04) :813-823
[27]   Traffic sign detection and recognition based on pyramidal convolutional networks [J].
Liang, Zhenwen ;
Shao, Jie ;
Zhang, Dongyang ;
Gao, Lianli .
NEURAL COMPUTING & APPLICATIONS, 2020, 32 (11) :6533-6543
[28]  
Liu X., 2020, P IEEE CVF WINT C AP, P2416
[29]   Exploit Camera Raw Data for Video Super- Resolution via Hidden Markov Model Inference [J].
Liu, Xiaohong ;
Shi, Kangdi ;
Wang, Zhe ;
Chen, Jun .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :2127-2140
[30]   Robust Multi-Frame Super-Resolution Based on Spatially Weighted Half-Quadratic Estimation and Adaptive BTV Regularization [J].
Liu, Xiaohong ;
Chen, Lei ;
Wang, Wenyi ;
Zhao, Jiying .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (10) :4971-4986