Self-Supervised Video Super-Resolution by Spatial Constraint and Temporal Fusion

被引:6
|
作者
Yang, Cuixin [1 ,2 ,3 ,4 ,5 ]
Luo, Hongming [1 ,2 ,3 ,4 ,5 ]
Liao, Guangsen [1 ,2 ,3 ,4 ,5 ]
Lu, Zitao [1 ,2 ,3 ,4 ,5 ]
Zhou, Fei [1 ,2 ,3 ,4 ,5 ]
Qiu, Guoping [1 ,3 ,4 ,5 ]
机构
[1] Shenzhen Univ, Coll Elect & Informat Engn, Shenzhen, Peoples R China
[2] Peng Cheng Lab, Shenzhen, Peoples R China
[3] Guangdong Key Lab Intelligent Informat Proc, Shenzhen, Peoples R China
[4] Shenzhen Inst Artificial Intelligence & Robot Soc, Shenzhen, Peoples R China
[5] Key Lab Digital Creat Technol, Shenzhen, Peoples R China
来源
PATTERN RECOGNITION AND COMPUTER VISION,, PT III | 2021年 / 13021卷
关键词
Video super-resolution; Self-supervision; Deep learning;
D O I
10.1007/978-3-030-88010-1_21
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To avoid any fallacious assumption on the degeneration procedure in preparing training data, some self-similarity based super-resolution (SR) algorithms have been proposed to exploit the internal recurrence of patches without relying on external datasets. However, the network architectures of those "zero-shot" SR methods are often shallow. Otherwise they would suffer from the over-fitting problem due to the limited samples within a single image. This restricts the strong power of deep neural networks (DNNs). To relieve this problem, we propose a middle-layer feature loss to allow the network architecture to be deeper for handling the video super-resolution (VSR) task in a self-supervised way. Specifically, we constrain the middle-layer feature of VSR network to be as similar as that of the corresponding single image super-resolution (SISR) in a Spatial Module, then fuse the inter-frame information in a Temporal Fusion Module. Experimental results demonstrate that the proposed algorithm achieves significantly superior results on real-world data in comparison with some state-of-the-art methods.
引用
收藏
页码:249 / 260
页数:12
相关论文
共 50 条
  • [21] ADAPTIVE INCREMENTAL VIDEO SUPER-RESOLUTION WITH TEMPORAL CONSISTENCY
    Su, Heng
    Wu, Ying
    Zhou, Jie
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011, : 1149 - 1152
  • [22] Multi-Stage Feature Fusion Network for Video Super-Resolution
    Song, Huihui
    Xu, Wenjie
    Liu, Dong
    Liu, Bo
    Liu, Qingshan
    Metaxas, Dimitris N.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 2923 - 2934
  • [23] SELF-SUPERVISED PUSH-FRAME SUPER-RESOLUTION WITH DETAIL-PRESERVING CONTROL AND OUTLIER DETECTION
    Nguyen, Ngoc Long
    Anger, Jeremy
    Davy, Axel
    Arias, Pablo
    Facciolo, Gabriele
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 131 - 134
  • [24] A 'deep' review of video super-resolution
    Gopalakrishnan, Subhadra
    Choudhury, Anustup
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2024, 129
  • [25] Fine-grained video super-resolution via spatial-temporal learning and image detail enhancement
    Yeh, Chia -Hung
    Yang, Hsin-Fu
    Lin, Yu -Yang
    Huang, Wan-Jen
    Tsai, Feng-Hsu
    Kang, Li - Wei
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 131
  • [26] Self-guided Transformer for Video Super-Resolution
    Xue, Tong
    Wang, Qianrui
    Huang, Xinyi
    Li, Dengshi
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT X, 2024, 14434 : 186 - 198
  • [27] Noise-robust video super-resolution using an adaptive spatial-temporal filter
    Jing Hu
    Yupin Luo
    Multimedia Tools and Applications, 2015, 74 : 9259 - 9278
  • [28] You Only Align Once: Bidirectional Interaction for Spatial-Temporal Video Super-Resolution
    Hu, Mengshun
    Jiang, Kui
    Nie, Zhixiang
    Wang, Zheng
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022,
  • [29] Lightweight video super-resolution based on hybrid spatio-temporal convolution
    Xia, Zhenping
    Chen, Hao
    Zhang, Yuning
    Cheng, Cheng
    Hu, Fuyuan
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2024, 32 (16): : 2564 - 2576
  • [30] Deeply feature fused video super-resolution network using temporal grouping
    Zhensen Chen
    Wenyuan Yang
    Jingmin Yang
    The Journal of Supercomputing, 2022, 78 : 8999 - 9016