Resampling video super-resolution based on multi-scale guided optical flow

被引:0
|
作者
Li, Puying [1 ]
Zhu, Fuzhen [1 ]
Liu, Yong [1 ]
Zhang, Qi [1 ]
机构
[1] Heilongjiang Univ, Sch Elect Engn, Harbin 150080, Peoples R China
关键词
Video super-resolution; Transformer; Multi-scale adaptive flow estimation; Resampling; NETWORKS;
D O I
10.1016/j.compeleceng.2025.110176
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Existing video super-resolution (VSR) methods are inadequate for dealing with inter-frame motion and spatial distortion problems, especially in high-motion scenes, which tend to lead to loss of details and degradation of reconstruction quality. To address these challenges, this paper puts forward a resampling video super-resolution algorithm based on multiscale guided optical flow. The method combines multi-scale guided optical flow estimation to address the issue of interframe motion and a resampling deformable convolution module to address the issue of spatial distortion. Specifically, features are first extracted from low-quality video frames using a convolutional layer, followed by feature extraction with Residual Swin Transformer Blocks (RSTBs). In the feature alignment module, a multiscale-guided optical flow estimation approach is employed, which addresses the inter-frame motion problem across different video segments and performs video frame interpolation and super-resolution reconstruction simultaneously. Furthermore, spatial alignment is achieved by integrating resampling into the deformable convolution module, mitigating spatial distortion. Finally, multiple Residual Swin Transformer Blocks (RSTBs) are used to extract and fuse features, and pixel rearrangement layers are employed to reconstruct high-quality video frames. The experimental results on the REDS, Vid4, and UDM10 datasets show that our method significantly outperforms current state-of-the-art (SOTA) techniques, with improvements of 0.61 dB in Peak Signal-to-Noise Ratio (PSNR) and 0.0121 in Structural Similarity (SSIM), validating the effectiveness and superiority of the method.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Multi-scale Residual Dense Block for Video Super-Resolution
    Cui, Hetao
    Sun, Quansen
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: VISUAL DATA ENGINEERING, PT I, 2019, 11935 : 424 - 434
  • [2] Bidirectional Multi-scale Deformable Attention for Video Super-Resolution
    Zhenghua Zhou
    Boxiang Xue
    Hai Wang
    Jianwei Zhao
    Multimedia Tools and Applications, 2024, 83 : 27809 - 27830
  • [3] Bidirectional Multi-scale Deformable Attention for Video Super-Resolution
    Zhou, Zhenghua
    Xue, Boxiang
    Wang, Hai
    Zhao, Jianwei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (09) : 27809 - 27830
  • [4] Video super-resolution based on multi-scale 3D convolution
    Zhan K.
    Sun Y.
    Li Y.
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2021, 48 (05): : 8 - 14
  • [5] Attention-guided video super-resolution with recurrent multi-scale spatial–temporal transformer
    Wei Sun
    Xianguang Kong
    Yanning Zhang
    Complex & Intelligent Systems, 2023, 9 : 3989 - 4002
  • [6] Optical flow for video super-resolution: a survey
    Tu, Zhigang
    Li, Hongyan
    Xie, Wei
    Liu, Yuanzhong
    Zhang, Shifu
    Li, Baoxin
    Yuan, Junsong
    ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (08) : 6505 - 6546
  • [7] Attention-guided video super-resolution with recurrent multi-scale spatial-temporal transformer
    Sun, Wei
    Kong, Xianguang
    Zhang, Yanning
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (04) : 3989 - 4002
  • [8] Optical flow for video super-resolution: a survey
    Zhigang Tu
    Hongyan Li
    Wei Xie
    Yuanzhong Liu
    Shifu Zhang
    Baoxin Li
    Junsong Yuan
    Artificial Intelligence Review, 2022, 55 : 6505 - 6546
  • [9] LightVSR: A Lightweight Video Super-Resolution Model with Multi-Scale Feature Aggregation
    Huang, Guanglun
    Li, Nachuan
    Liu, Jianming
    Zhang, Minghe
    Zhang, Li
    Li, Jun
    APPLIED SCIENCES-BASEL, 2025, 15 (03):
  • [10] Video Super-Resolution Using Multi-Scale and Non-Local Feature Fusion
    Li, Yanghui
    Zhu, Hong
    Hou, Qian
    Wang, Jing
    Wu, Wenhuan
    ELECTRONICS, 2022, 11 (09)