Bidirectional Multi-scale Deformable Attention for Video Super-Resolution

被引:0
|
作者
Zhenghua Zhou
Boxiang Xue
Hai Wang
Jianwei Zhao
机构
[1] Zhejiang University of Finance and Economics,School of Data Sciences
[2] China Jiliang University,Department of Data Science, College of Sciences
[3] Murdoch University,Discipline of Engineering and Energy
[4] China Jiliang University,College of Information Engineering
来源
Multimedia Tools and Applications | 2024年 / 83卷
关键词
Video super-resolution; Multi-scale deformable convolution; Multi-scale attention; Bidirectional propagation;
D O I
暂无
中图分类号
学科分类号
摘要
Video super-resolution aims to generate a high-resolution video frame from its low-resolution video sequences. Video super-resolution is still a challenging problem due to performing the temporal frame alignment and spatial feature fusion during the process of spatial-temporal modeling. Existing deep learning based methods have limitations in handling accurate alignment and effective fusion of frames with multi-scale feature information. In this paper, we propose Bidirectional Multi-scale Deformable Attention (BMDA) for video Super-Resolution in terms of propagation, alignment and fusion. More specifically, the developed Deformable Alignment Module (DAM) in BMDA contains two kinds of modules: Multi-scale Deformable Convolution Module (MDCM) and Multi-scale Attention Module (MAM). MDCM is leveraged to deal with the offset information in different scales and align adjacent frames at the feature level, improving the robustness of the alignment among adjacent frames. MAM is designed to extract the local and global features of the aligned features for aggregation, such that the feature information compensation between pixels is achieved. Additionally, in order to make full use of shallow features, dense connection structure between each layer is adopted in the framework of bidirectional propagation to achieve better visual performance on video super-resolution. In particular, our proposed BDAM outperforms BasicVSR by up to 1.28dB in PSNR when batch size is set to 2. Experimental results on public video benchmark datasets demonstrate that the proposed method can achieve superior performance on large motion videos as compared with the state-of-the art methods.
引用
收藏
页码:27809 / 27830
页数:21
相关论文
共 50 条
  • [11] Super-Resolution Network with Information Distillation and Multi-Scale Attention for Medical CT Image
    Zhao, Tianliu
    Hu, Lei
    Zhang, Yongmei
    Fang, Jianying
    SENSORS, 2021, 21 (20)
  • [12] Video Super-Resolution Using Multi-Scale and Non-Local Feature Fusion
    Li, Yanghui
    Zhu, Hong
    Hou, Qian
    Wang, Jing
    Wu, Wenhuan
    ELECTRONICS, 2022, 11 (09)
  • [13] Deformable transformer for endoscopic video super-resolution
    Song, Xiaowei
    Tang, Hui
    Yang, Chunfeng
    Zhou, Guangquan
    Wang, Yangang
    Huang, Xinjun
    Hua, Jie
    Coatrieux, Gouenou
    He, Xiaopu
    Chen, Yang
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 77
  • [14] Video Super-Resolution using Multi-scale Pyramid 3D Convolutional Networks
    Luo, Jianping
    Huang, Shaofei
    Yuan, Yuan
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1882 - 1890
  • [15] FLOW-GUIDED DEFORMABLE ATTENTION NETWORK FOR FAST ONLINE VIDEO SUPER-RESOLUTION
    Yang, Xi
    Zhang, Xindong
    Zhang, Lei
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 390 - 394
  • [16] Bidirectional scale-aware upsampling network for arbitrary-scale video super-resolution
    Luo, Laigan
    Yi, Benshun
    Wang, Zhongyuan
    He, Zheng
    Zhu, Chao
    IMAGE AND VISION COMPUTING, 2024, 148
  • [17] MAPANet: A Multi-Scale Attention-Guided Progressive Aggregation Network for Multi-Contrast MRI Super-Resolution
    Liu, Licheng
    Liu, Tao
    Zhou, Wei
    Wang, Yaonan
    Liu, Min
    IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2024, 10 : 928 - 940
  • [18] Deformable 3D Convolution for Video Super-Resolution
    Ying, Xinyi
    Wang, Longguang
    Wang, Yingqian
    Sheng, Weidong
    An, Wei
    Guo, Yulan
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 1500 - 1504
  • [19] Deformable Non-Local Network for Video Super-Resolution
    Wang, Hua
    Su, Dewei
    Liu, Chuangchuang
    Jin, Longcun
    Sun, Xianfang
    Peng, Xinyi
    IEEE ACCESS, 2019, 7 : 177734 - 177744
  • [20] MLDAN:Multi-scale large kernel decomposition attention network super-resolution of lung computed tomography images
    Li, Yanmei
    Li, Xiaoshuang
    Luo, Jian
    Yu, Tao
    Deng, Jingshi
    Yang, Qibin
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 107