Bidirectional Multi-scale Deformable Attention for Video Super-Resolution

被引:0
作者
Zhenghua Zhou
Boxiang Xue
Hai Wang
Jianwei Zhao
机构
[1] Zhejiang University of Finance and Economics,School of Data Sciences
[2] China Jiliang University,Department of Data Science, College of Sciences
[3] Murdoch University,Discipline of Engineering and Energy
[4] China Jiliang University,College of Information Engineering
来源
Multimedia Tools and Applications | 2024年 / 83卷
关键词
Video super-resolution; Multi-scale deformable convolution; Multi-scale attention; Bidirectional propagation;
D O I
暂无
中图分类号
学科分类号
摘要
Video super-resolution aims to generate a high-resolution video frame from its low-resolution video sequences. Video super-resolution is still a challenging problem due to performing the temporal frame alignment and spatial feature fusion during the process of spatial-temporal modeling. Existing deep learning based methods have limitations in handling accurate alignment and effective fusion of frames with multi-scale feature information. In this paper, we propose Bidirectional Multi-scale Deformable Attention (BMDA) for video Super-Resolution in terms of propagation, alignment and fusion. More specifically, the developed Deformable Alignment Module (DAM) in BMDA contains two kinds of modules: Multi-scale Deformable Convolution Module (MDCM) and Multi-scale Attention Module (MAM). MDCM is leveraged to deal with the offset information in different scales and align adjacent frames at the feature level, improving the robustness of the alignment among adjacent frames. MAM is designed to extract the local and global features of the aligned features for aggregation, such that the feature information compensation between pixels is achieved. Additionally, in order to make full use of shallow features, dense connection structure between each layer is adopted in the framework of bidirectional propagation to achieve better visual performance on video super-resolution. In particular, our proposed BDAM outperforms BasicVSR by up to 1.28dB in PSNR when batch size is set to 2. Experimental results on public video benchmark datasets demonstrate that the proposed method can achieve superior performance on large motion videos as compared with the state-of-the art methods.
引用
收藏
页码:27809 / 27830
页数:21
相关论文
共 50 条
  • [21] Video Super-Resolution via Bidirectional Recurrent Convolutional Networks
    Huang, Yan
    Wang, Wei
    Wang, Liang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 1015 - 1028
  • [22] BFRVSR: A Bidirectional Frame Recurrent Method for Video Super-Resolution
    Xue, Xiongxiong
    Han, Zhenqi
    Tong, Weiqin
    Li, Mingqi
    Liu, Lizhuang
    APPLIED SCIENCES-BASEL, 2020, 10 (23): : 1 - 11
  • [23] S2A: Scale-Attention-Aware Networks for Video Super-Resolution
    Guo, Taian
    Dai, Tao
    Liu, Ling
    Zhu, Zexuan
    Xia, Shu-Tao
    ENTROPY, 2021, 23 (11)
  • [24] Video super-resolution with phase-aided deformable alignment network
    Cai, Zhuojun
    Chen, Yaowu
    Tian, Xiang
    Jiang, Rongxin
    JOURNAL OF ELECTRONIC IMAGING, 2020, 29 (03)
  • [25] Learning a Deep Dual Attention Network for Video Super-Resolution
    Li, Feng
    Bai, Huihui
    Zhao, Yao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 4474 - 4488
  • [26] DEFORMABLE ALIGNMENT AND SCALE-ADAPTIVE FEATURE EXTRACTION NETWORK FOR CONTINUOUS-SCALE SATELLITE VIDEO SUPER-RESOLUTION
    Ni, Ning
    Wu, Hanlin
    Zhang, Libao
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2746 - 2750
  • [27] DDAN: A DEEP DUAL ATTENTION NETWORK FOR VIDEO SUPER-RESOLUTION
    Sun, Xiyue
    Li, Feng
    Bai, Huihui
    Zhao, Yao
    2021 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2021,
  • [28] A Lightweight Recurrent Grouping Attention Network for Video Super-Resolution
    Zhu, Yonggui
    Li, Guofang
    SENSORS, 2023, 23 (20)
  • [29] Bidirectional Temporal-Recurrent Propagation Networks for Video Super-Resolution
    Han, Lei
    Fan, Cien
    Yang, Ye
    Zou, Lian
    ELECTRONICS, 2020, 9 (12) : 1 - 15
  • [30] Dual Attention with the Self-Attention Alignment for Efficient Video Super-resolution
    Chu, Yuezhong
    Qiao, Yunan
    Liu, Heng
    Han, Jungong
    COGNITIVE COMPUTATION, 2022, 14 (03) : 1140 - 1151