A Deformable Convolutional Neural Network for Video Super-Resolution

被引:0
作者
Chen, Xi [1 ,2 ,3 ,4 ,5 ]
Zhang, Qi [6 ,7 ]
Liu, Kai [4 ]
Zhang, Yong [2 ,3 ]
机构
[1] Guizhou Univ, State Key Lab Publ Big Data, Guiyang, Peoples R China
[2] Shenzhen Univ, Guangdong Key Lab Intelligent Informat Proc, Shenzhen, Peoples R China
[3] Shenzhen Univ, Coll Elect & Informat Engn, Shenzhen, Peoples R China
[4] Northwestern Polytech Univ, Sch Software, Xian, Peoples R China
[5] Northwestern Polytech Univ, Yangtze River Delta Res Inst, Taicang, Peoples R China
[6] Harbin Inst Technol Weihai, Sch Econ & Management, Weihai, Peoples R China
[7] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Peoples R China
关键词
deformable convolution; neural network; video super-resolution;
D O I
10.1111/coin.70052
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional Neural Networks used deep architectures to achieve deep feature extraction in video super-resolution. However, they suffered from challenges of rapid motion and complex scenes in video super-resolution. In this paper, we present a deformable convolutional neural network for video super-resolution (DVSRNet). DVSRNet mainly contains forward and backward feature propagation blocks (FPBs), feature enhancement blocks (FEBs), a feature fusion block (FFB), and a reconstruction block (RB). FPBs can leverage temporal sequence information to capture rich temporal dimensional information in video super-resolution. To restore detailed information, an optical flow technique guided a CNN to align the obtained structural information of different frames to reduce motion-induced blur and artifacts. To address deformable videos from motioned objects, two FEBs utilized deformable convolutions to adaptively correct misaligned objects to improve spatial continuity of videos. To improve reliability of obtained videos, an FFB is used to integrate relations of different video frames from forward and backward propagations. Finally, an RB via upsampling operations and a residual learning technique is used to construct high-quality videos. Experimental results demonstrate that our DVSRNet exhibits superior performance on multiple public datasets for video super-resolution. Its codes can be available at .
引用
收藏
页数:10
相关论文
共 40 条
[1]   Real-Time Video Super-Resolution with Spatio-Temporal Networks and Motion Compensation [J].
Caballero, Jose ;
Ledig, Christian ;
Aitken, Andrew ;
Acosta, Alejandro ;
Totz, Johannes ;
Wang, Zehan ;
Shi, Wenzhe .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2848-2857
[2]   Stable Long-Term Recurrent Video Super-Resolution [J].
Chiche, Benjamin Naoto ;
Woiselle, Arnaud ;
Frontera-Pons, Joana ;
Starck, Jean-Luc .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :827-836
[3]   Deformable Convolutional Networks [J].
Dai, Jifeng ;
Qi, Haozhi ;
Xiong, Yuwen ;
Li, Yi ;
Zhang, Guodong ;
Hu, Han ;
Wei, Yichen .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773
[4]   Learning a Deep Convolutional Network for Image Super-Resolution [J].
Dong, Chao ;
Loy, Chen Change ;
He, Kaiming ;
Tang, Xiaoou .
COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 :184-199
[5]  
Fahmy I., 2024, Journal of Artificial Intelligence and Autonomous Intelligence, V1, P32
[6]   High-resolution optical flow and frame-recurrent network for video super-resolution and deblurring [J].
Fang, Ning ;
Zhan, Zongqian .
NEUROCOMPUTING, 2022, 489 :128-138
[7]   Efficient Video Super-Resolution through Recurrent Latent Space Propagation [J].
Fuoli, Dario ;
Gu, Shuhang ;
Timofte, Radu .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, :3476-3485
[8]   Recurrent Back-Projection Network for Video Super-Resolution [J].
Haris, Muhammad ;
Shakhnarovich, Greg ;
Ukita, Norimichi .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3892-3901
[9]  
Isobe T, 2020, Arxiv, DOI arXiv:2008.05765
[10]   Video Super-Resolution with Recurrent Structure-Detail Network [J].
Isobe, Takashi ;
Jia, Xu ;
Gu, Shuhang ;
Li, Songjiang ;
Wang, Shengjin ;
Tian, Qi .
COMPUTER VISION - ECCV 2020, PT XII, 2020, 12357 :645-660