Self-Supervised Video Super-Resolution by Spatial Constraint and Temporal Fusion

被引:6
|
作者
Yang, Cuixin [1 ,2 ,3 ,4 ,5 ]
Luo, Hongming [1 ,2 ,3 ,4 ,5 ]
Liao, Guangsen [1 ,2 ,3 ,4 ,5 ]
Lu, Zitao [1 ,2 ,3 ,4 ,5 ]
Zhou, Fei [1 ,2 ,3 ,4 ,5 ]
Qiu, Guoping [1 ,3 ,4 ,5 ]
机构
[1] Shenzhen Univ, Coll Elect & Informat Engn, Shenzhen, Peoples R China
[2] Peng Cheng Lab, Shenzhen, Peoples R China
[3] Guangdong Key Lab Intelligent Informat Proc, Shenzhen, Peoples R China
[4] Shenzhen Inst Artificial Intelligence & Robot Soc, Shenzhen, Peoples R China
[5] Key Lab Digital Creat Technol, Shenzhen, Peoples R China
来源
PATTERN RECOGNITION AND COMPUTER VISION,, PT III | 2021年 / 13021卷
关键词
Video super-resolution; Self-supervision; Deep learning;
D O I
10.1007/978-3-030-88010-1_21
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To avoid any fallacious assumption on the degeneration procedure in preparing training data, some self-similarity based super-resolution (SR) algorithms have been proposed to exploit the internal recurrence of patches without relying on external datasets. However, the network architectures of those "zero-shot" SR methods are often shallow. Otherwise they would suffer from the over-fitting problem due to the limited samples within a single image. This restricts the strong power of deep neural networks (DNNs). To relieve this problem, we propose a middle-layer feature loss to allow the network architecture to be deeper for handling the video super-resolution (VSR) task in a self-supervised way. Specifically, we constrain the middle-layer feature of VSR network to be as similar as that of the corresponding single image super-resolution (SISR) in a Spatial Module, then fuse the inter-frame information in a Temporal Fusion Module. Experimental results demonstrate that the proposed algorithm achieves significantly superior results on real-world data in comparison with some state-of-the-art methods.
引用
收藏
页码:249 / 260
页数:12
相关论文
共 50 条
  • [31] TMP: Temporal Motion Propagation for Online Video Super-Resolution
    Zhang, Zhengqiang
    Li, Ruihuang
    Guo, Shi
    Cao, Yang
    Zhang, Lei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 5014 - 5028
  • [32] Deeply feature fused video super-resolution network using temporal grouping
    Chen, Zhensen
    Yang, Wenyuan
    Yang, Jingmin
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (07) : 8999 - 9016
  • [33] Noise-robust video super-resolution using an adaptive spatial-temporal filter
    Hu, Jing
    Luo, Yupin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (21) : 9259 - 9278
  • [34] Physics-informed and self-supervised multi-image super-resolution reconstruction for digital holography microscopy
    Zhang, Xiangchao
    Ma, Xinyang
    Rong, Shuangquan
    SURFACE TOPOGRAPHY-METROLOGY AND PROPERTIES, 2025, 13 (01):
  • [35] Video Super-Resolution Reconstruction Based on Deep Learning and Spatio-Temporal Feature Self-Similarity
    Liang, Meiyu
    Du, Junping
    Li, Linghui
    Xue, Zhe
    Wang, Xiaoxiao
    Kou, Feifei
    Wang, Xu
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (09) : 4538 - 4553
  • [36] Nonlocal-guided enhanced interaction spatial-temporal network for compressed video super-resolution
    Junxiong Cheng
    Shuhua Xiong
    Xiaohai He
    Chao Ren
    Tingrong Zhang
    Honggang Chen
    Applied Intelligence, 2023, 53 : 24407 - 24421
  • [37] Attention-guided video super-resolution with recurrent multi-scale spatial–temporal transformer
    Wei Sun
    Xianguang Kong
    Yanning Zhang
    Complex & Intelligent Systems, 2023, 9 : 3989 - 4002
  • [38] Nonlocal-guided enhanced interaction spatial-temporal network for compressed video super-resolution
    Cheng, Junxiong
    Xiong, Shuhua
    He, Xiaohai
    Ren, Chao
    Zhang, Tingrong
    Chen, Honggang
    APPLIED INTELLIGENCE, 2023, 53 (20) : 24407 - 24421
  • [39] A Survey of Deep Learning Video Super-Resolution
    Baniya, Arbind Agrahari
    Lee, Tsz-Kwan
    Eklund, Peter W.
    Aryal, Sunil
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (04): : 2655 - 2676
  • [40] Video super-resolution with inverse recurrent net and hybrid local fusion
    Li, Dingyi
    Wang, Zengfu
    Yang, Jian
    NEUROCOMPUTING, 2022, 489 : 40 - 51