Video Super-resolution with Temporal Group Attention

被引:151
作者
Isobe, Takashi [1 ,2 ]
Li, Songjiang [2 ]
Jia, Xu [2 ]
Yuan, Shanxin [2 ]
Slabaugh, Gregory [2 ]
Xu, Chunjing [2 ]
Li, Ya-Li [1 ]
Wang, Shengjin [1 ]
Tian, Qi [2 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China
[2] Huawei Technol, Noahs Ark Lab, Shenzhen, Peoples R China
来源
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020) | 2020年
关键词
D O I
10.1109/CVPR42600.2020.00803
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video super-resolution, which aims at producing a high-resolution video from its corresponding low-resolution version, has recently drawn increasing attention. In this work, we propose a novel method that can effectively incorporate temporal information in a hierarchical way. The input sequence is divided into several groups, with each one corresponding to a kind of frame rate. These groups provide complementary information to recover missing details in the reference frame, which is further integrated with an attention module and a deep intra-group fusion module. In addition, a fast spatial alignment is proposed to handle videos with large motion. Extensive results demonstrate the capability of the proposed model in handling videos with various motion. It achieves favorable performance against state-of-the-art methods on several benchmark datasets. Code is available at https://github.com/junpan19/VSR_TGA.
引用
收藏
页码:8005 / 8014
页数:10
相关论文
共 35 条
[21]   Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network [J].
Shi, Wenzhe ;
Caballero, Jose ;
Huszar, Ferenc ;
Totz, Johannes ;
Aitken, Andrew P. ;
Bishop, Rob ;
Rueckert, Daniel ;
Wang, Zehan .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1874-1883
[22]  
Song SJ, 2017, AAAI CONF ARTIF INTE, P4263
[23]   Detail-revealing Deep Video Super-resolution [J].
Tao, Xin ;
Gao, Hongyun ;
Liao, Renjie ;
Wang, Jue ;
Jia, Jiaya .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4482-4490
[24]   TDAN: Temporally-Deformable Alignment Network for Video Super-Resolution [J].
Tian, Yapeng ;
Zhang, Yulun ;
Fu, Yun ;
Xu, Chenliang .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :3357-3366
[25]   NTIRE 2017 Challenge on Single Image Super-Resolution: Methods and Results [J].
Timofte, Radu ;
Agustsson, Eirikur ;
Van Gool, Luc ;
Yang, Ming-Hsuan ;
Zhang, Lei ;
Lim, Bee ;
Son, Sanghyun ;
Kim, Heewon ;
Nah, Seungjun ;
Lee, Kyoung Mu ;
Wang, Xintao ;
Tian, Yapeng ;
Yu, Ke ;
Zhang, Yulun ;
Wu, Shixiang ;
Dong, Chao ;
Lin, Liang ;
Qiao, Yu ;
Loy, Chen Change ;
Bae, Woong ;
Yoo, Jaejun ;
Han, Yoseob ;
Ye, Jong Chul ;
Choi, Jae-Seok ;
Kim, Munchurl ;
Fan, Yuchen ;
Yu, Jiahui ;
Han, Wei ;
Liu, Ding ;
Yu, Haichao ;
Wang, Zhangyang ;
Shi, Honghui ;
Wang, Xinchao ;
Huang, Thomas S. ;
Chen, Yunjin ;
Zhang, Kai ;
Zuo, Wangmeng ;
Tang, Zhimin ;
Luo, Linkai ;
Li, Shaohui ;
Fu, Min ;
Cao, Lei ;
Heng, Wen ;
Bui, Giang ;
Truc Le ;
Duan, Ye ;
Tao, Dacheng ;
Wang, Ruxin ;
Lin, Xu ;
Pang, Jianxin .
2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, :1110-1121
[26]   EDVR: Video Restoration with Enhanced Deformable Convolutional Networks [J].
Wang, Xintao ;
Chan, Kelvin C. K. ;
Yu, Ke ;
Dong, Chao ;
Loy, Chen Change .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, :1954-1963
[27]   Video Enhancement with Task-Oriented Flow [J].
Xue, Tianfan ;
Chen, Baian ;
Wu, Jiajun ;
Wei, Donglai ;
Freeman, William T. .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2019, 127 (08) :1106-1125
[28]   STAT: Spatial-Temporal Attention Mechanism for Video Captioning [J].
Yan, Chenggang ;
Tu, Yunbin ;
Wang, Xingzheng ;
Zhang, Yongbing ;
Hao, Xinhong ;
Zhang, Yongdong ;
Dai, Qionghai .
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (01) :229-241
[29]   Deep Learning for Single Image Super-Resolution: A Brief Review [J].
Yang, Wenming ;
Zhang, Xuechen ;
Tian, Yapeng ;
Wang, Wei ;
Xue, Jing-Hao ;
Liao, Qingmin .
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (12) :3106-3121
[30]  
Kim SY, 2019, Arxiv, DOI arXiv:1812.09079