Bidirectional Multi-scale Deformable Attention for Video Super-Resolution

被引:0
|
作者
Zhenghua Zhou
Boxiang Xue
Hai Wang
Jianwei Zhao
机构
[1] Zhejiang University of Finance and Economics,School of Data Sciences
[2] China Jiliang University,Department of Data Science, College of Sciences
[3] Murdoch University,Discipline of Engineering and Energy
[4] China Jiliang University,College of Information Engineering
来源
Multimedia Tools and Applications | 2024年 / 83卷
关键词
Video super-resolution; Multi-scale deformable convolution; Multi-scale attention; Bidirectional propagation;
D O I
暂无
中图分类号
学科分类号
摘要
Video super-resolution aims to generate a high-resolution video frame from its low-resolution video sequences. Video super-resolution is still a challenging problem due to performing the temporal frame alignment and spatial feature fusion during the process of spatial-temporal modeling. Existing deep learning based methods have limitations in handling accurate alignment and effective fusion of frames with multi-scale feature information. In this paper, we propose Bidirectional Multi-scale Deformable Attention (BMDA) for video Super-Resolution in terms of propagation, alignment and fusion. More specifically, the developed Deformable Alignment Module (DAM) in BMDA contains two kinds of modules: Multi-scale Deformable Convolution Module (MDCM) and Multi-scale Attention Module (MAM). MDCM is leveraged to deal with the offset information in different scales and align adjacent frames at the feature level, improving the robustness of the alignment among adjacent frames. MAM is designed to extract the local and global features of the aligned features for aggregation, such that the feature information compensation between pixels is achieved. Additionally, in order to make full use of shallow features, dense connection structure between each layer is adopted in the framework of bidirectional propagation to achieve better visual performance on video super-resolution. In particular, our proposed BDAM outperforms BasicVSR by up to 1.28dB in PSNR when batch size is set to 2. Experimental results on public video benchmark datasets demonstrate that the proposed method can achieve superior performance on large motion videos as compared with the state-of-the art methods.
引用
收藏
页码:27809 / 27830
页数:21
相关论文
共 50 条
  • [1] Bidirectional Multi-scale Deformable Attention for Video Super-Resolution
    Zhou, Zhenghua
    Xue, Boxiang
    Wang, Hai
    Zhao, Jianwei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (09) : 27809 - 27830
  • [2] Multi-scale Residual Dense Block for Video Super-Resolution
    Cui, Hetao
    Sun, Quansen
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: VISUAL DATA ENGINEERING, PT I, 2019, 11935 : 424 - 434
  • [3] HAMSA: Hybrid attention transformer and multi-scale alignment aggregation network for video super-resolution
    Xiao, Hanguang
    Wen, Hao
    Wang, Xin
    Zuo, Kun
    Liu, Tianqi
    Wang, Wei
    Xu, Yong
    DIGITAL SIGNAL PROCESSING, 2025, 161
  • [4] Attention-guided video super-resolution with recurrent multi-scale spatial–temporal transformer
    Wei Sun
    Xianguang Kong
    Yanning Zhang
    Complex & Intelligent Systems, 2023, 9 : 3989 - 4002
  • [5] Attention-guided video super-resolution with recurrent multi-scale spatial-temporal transformer
    Sun, Wei
    Kong, Xianguang
    Zhang, Yanning
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (04) : 3989 - 4002
  • [6] Image super-resolution using multi-scale non-local attention
    Kim, Sowon
    Park, Hanhoon
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (01)
  • [7] Deformable Spatial-Temporal Attention for Lightweight Video Super-Resolution
    Xue, Tong
    Huang, Xinyi
    Li, Dengshi
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT X, 2024, 14434 : 482 - 493
  • [8] Video super-resolution based on multi-scale 3D convolution
    Zhan K.
    Sun Y.
    Li Y.
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2021, 48 (05): : 8 - 14
  • [9] Resampling video super-resolution based on multi-scale guided optical flow
    Li, Puying
    Zhu, Fuzhen
    Liu, Yong
    Zhang, Qi
    COMPUTERS & ELECTRICAL ENGINEERING, 2025, 123
  • [10] LightVSR: A Lightweight Video Super-Resolution Model with Multi-Scale Feature Aggregation
    Huang, Guanglun
    Li, Nachuan
    Liu, Jianming
    Zhang, Minghe
    Zhang, Li
    Li, Jun
    APPLIED SCIENCES-BASEL, 2025, 15 (03):