A Lightweight CNN-Transformer Implemented via Structural Re-Parameterization and Hybrid Attention for Remote Sensing Image Super-Resolution

被引:0
|
作者
Wang, Jie [1 ]
Li, Hongwei [1 ]
Li, Yifan [2 ]
Qin, Zilong [3 ]
机构
[1] Zhengzhou Univ, Sch Geosci & Technol, Zhengzhou 450052, Peoples R China
[2] Univ Cologne, Inst Geophys & Meteorol, D-50923 Cologne, Germany
[3] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430079, Peoples R China
基金
中国国家自然科学基金;
关键词
super-resolution; remote sensing; CNN-Transformer; lightweight; hybrid attention;
D O I
10.3390/ijgi14010008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Remote sensing imagery contains rich information about geographical targets, and performing super-resolution (SR) reconstruction on such images requires greater feature representation capabilities. Convolutional neural network (CNN)-based methods excel at extracting intricate local features but fall short in terms of capturing global representations. While transformer methods are capable of learning long-distance dependencies, they often overlook local feature details, which can diminish the discriminability between the background and the foreground. Moreover, the distinctive architectures of transformers, their extensive parameter counts, and their reliance on large-scale training datasets impose constraints on transformer applications in remote sensing image feature extraction tasks. To address these challenges, this study introduces a novel hybrid CNN-Transformer network model named RepCHAT for remote sensing single image reconstruction, which incorporates a structural re-parameterization technique and a hybrid attention mechanism. This method leverages the strengths of transformers in terms of learning long-distance dependencies (global features) and CNNs with respect to extracting local features. The proposed approach achieves SR reconstruction for remote sensing images with fewer parameters and less computational overhead than those of traditional transformers and high-performance CNN models. We develop a multiscale feature extraction module that integrates both spatial- and frequency-domain features and employs structural re-parameterization theory to increase the inference efficiency of the model. Furthermore, we incorporate depthwise-separable convolution into the transformer block to bolster the local feature learning capabilities of the transformer. The method we propose achieves the optimal performance for remote sensing single-image super-resolution reconstruction and outperforms the competing methods by 0.28-1.05 dB (x4 scale) in terms of signal-to-noise ratio (PSNR). Experimental results indicate that the RepCHAT model proposed in this study maintains a high performance with significantly reduced complexity, making it suitable for deployment on edge devices.
引用
收藏
页数:21
相关论文
共 50 条
  • [11] DTCNet: Transformer-CNN Distillation for Super-Resolution of Remote Sensing Image
    Lin, Cong
    Mao, Xin
    Qiu, Chenghao
    Zou, Lilan
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 11117 - 11133
  • [12] Efficiently Amalgamated CNN-Transformer Network for Image Super-Resolution Reconstruction
    Zheng, Mengyuan
    Zang, Huaijuan
    Liu, Xinzhi
    Cheng, Guoan
    Zhan, Shu
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XI, 2024, 14435 : 3 - 13
  • [13] CTCNet: A CNN-Transformer Cooperation Network for Face Image Super-Resolution
    Gao, Guangwei
    Xu, Zixiang
    Li, Juncheng
    Yang, Jian
    Zeng, Tieyong
    Qi, Guo-Jun
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1978 - 1991
  • [14] A CNN-Transformer Embedded Unfolding Network for Hyperspectral Image Super-Resolution
    Tang, Yao
    Li, Jie
    Yue, Linwei
    Liu, Xinxin
    Li, Yajie
    Xiao, Yi
    Yuan, Qiangqiang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [15] TRANSFORMER AND CNN HYBRID NETWORK FOR SUPER-RESOLUTION SEMANTIC SEGMENTATION OF REMOTE SENSING IMAGERY
    Liu, Yutong
    Gao, Kun
    Wang, Hong
    Wang, Junwei
    Zhang, Xiaodian
    Wang, Pengyu
    Li, Shuzhong
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 6940 - 6943
  • [16] Remote Sensing Image Super-Resolution via Residual-Dense Hybrid Attention Network
    Yu, Bo
    Lei, Bin
    Guo, Jiayi
    Sun, Jiande
    Li, Shengtao
    Xie, Guangshuai
    REMOTE SENSING, 2022, 14 (22)
  • [17] Hybrid-Scale Hierarchical Transformer for Remote Sensing Image Super-Resolution
    Shang, Jianrun
    Gao, Mingliang
    Li, Qilei
    Pan, Jinfeng
    Zou, Guofeng
    Jeon, Gwanggil
    REMOTE SENSING, 2023, 15 (13)
  • [18] Efficient Remote Sensing Image Super-Resolution via Lightweight Diffusion Models
    An, Tai
    Xue, Bin
    Huo, Chunlei
    Xiang, Shiming
    Pan, Chunhong
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [19] PEFormer: a pixel-level enhanced CNN-transformer hybrid network for face image super-resolution
    Lu, Xinbiao
    Gao, Xing
    Chen, Yisen
    Chen, Guiyun
    Yang, Tieliu
    Chen, Yudan
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (10) : 7303 - 7317
  • [20] TCSR: Lightweight Transformer and CNN Interaction Network for Image Super-Resolution
    Cai, Danlin
    Tan, Wenwen
    Chen, Feiyang
    Lou, Xinchi
    Xiahou, Jianbin
    Zhu, Daxin
    Huang, Detian
    IEEE ACCESS, 2024, 12 : 174782 - 174795