MadFormer: multi-attention-driven image super-resolution method based on Transformer

被引:4
|
作者
Liu, Beibei [1 ]
Sun, Jing [1 ]
Zhu, Bing [2 ]
Li, Ting [1 ]
Sun, Fuming [1 ]
机构
[1] Dalian Minzu Univ, Sch Informat & Commun Engn, Liaohe West Rd, Dalian 116600, Liaoning, Peoples R China
[2] Harbin Inst Technol, Sch Elect & Informat Engn, Xidazhi St, Harbin 150006, Heilongjiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Image super-resolution; Transformer; Multi-attention-driven; Dynamic fusion;
D O I
10.1007/s00530-024-01276-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
While the Transformer-based method has demonstrated exceptional performance in low-level visual processing tasks, it has a strong modeling ability only locally, thereby neglecting the importance of spatial feature information and high-frequency details within the channel for super-resolution. To enhance feature information and improve the visual experience, we propose a multi-attention-driven image super-resolution method based on a Transformer network, called MadFormer. Initially, the low-resolution image undergoes an initial convolution operation to extract shallow features while being fed into a residual multi-attention block incorporating channel attention, spatial attention, and self-attention mechanisms. By employing multi-head self-attention, the proposed method aims to capture global-local feature information; channel attention and spatial attention are utilized to effectively capture high-frequency features in both the channel and spatial domains. Subsequently, deep feature information is inputted into a dynamic fusion block that dynamically fuses multi-attention extracted features, facilitating the aggregation of cross-window information. Ultimately, the shallow and deep feature information is fused via convolution operations, yielding high-resolution images through high-quality reconstruction. Comprehensive quantitative and qualitative comparisons with other advanced algorithms demonstrate the substantial advantages of the proposed approach in terms of peak signal-to-noise ratio (PSNR) and structural similarity (SSIM) for image super-resolution.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Single image super-resolution based on multi-scale dense attention network
    Farong Gao
    Yong Wang
    Zhangyi Yang
    Yuliang Ma
    Qizhong Zhang
    Soft Computing, 2023, 27 : 2981 - 2992
  • [42] Dual Aggregation Transformer for Image Super-Resolution
    Chen, Zheng
    Zhang, Yulun
    Gu, Jinjin
    Kong, Linghe
    Yang, Xiaokang
    Yu, Fisher
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 12278 - 12287
  • [43] Image Super-Resolution Based on Residual Attention and Multi-Scale Feature Fusion
    Kou, Qiqi
    Zhao, Jiamin
    Cheng, Deqiang
    Su, Zhen
    Zhu, Xingguang
    IEEE ACCESS, 2023, 11 : 59530 - 59541
  • [44] Image super-resolution reconstruction based on multi-scale dual-attention
    Li, Hong-an
    Wang, Diao
    Zhang, Jing
    Li, Zhanli
    Ma, Tian
    CONNECTION SCIENCE, 2023, 35 (01)
  • [45] Lightweight Image Super-Resolution Reconstruction Method Based on Multi-scale Spatial Adaptive Attention Network
    Huang, Feng
    Liu, Hongwei
    Shen, Ying
    Qiu, Zhaobing
    Chen, Liqiong
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2025, 38 (01): : 36 - 50
  • [46] Lightweight Image Super-Resolution Network Based on Regional Complementary Attention and Multi-dimensional Attention
    Zhou D.
    Wang W.
    Ma Y.
    Gao D.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (07): : 625 - 636
  • [47] Multi-attention augmented network for single image super-resolution
    Chen, Rui
    Zhang, Heng
    Liu, Jixin
    PATTERN RECOGNITION, 2022, 122
  • [48] Multi-Grained Attention Networks for Single Image Super-Resolution
    Wu, Huapeng
    Zou, Zhengxia
    Gui, Jie
    Zeng, Wen-Jun
    Ye, Jieping
    Zhang, Jun
    Liu, Hongyi
    Wei, Zhihui
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (02) : 512 - 522
  • [49] A multi-scale enhanced large-kernel attention transformer network for lightweight image super-resolution
    Chang, Kairong
    Jun, Sun
    Biao, Yang
    Hu, Mingzhi
    Yang, Junlong
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (03)
  • [50] TBNet: Stereo Image Super-Resolution with Multi-Scale Attention
    Zhu, Jiyang
    Han, Xue
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2023, 32 (18)