Remote sensing image super-resolution via cross-scale hierarchical transformer

被引:8
|
作者
Xiao, Yi [1 ]
Yuan, Qiangqiang [1 ]
He, Jiang [1 ]
Zhang, Liangpei [2 ]
机构
[1] Wuhan Univ, Sch Geodesy & Geomat, Wuhan, Peoples R China
[2] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
Super-resolution; transformer; cross-scale; hierarchical attention; remote sensing; OBJECT DETECTION; NETWORK; RESOLUTION;
D O I
10.1080/10095020.2023.2288179
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Global and local modeling is essential for image super-resolution tasks. However, current efforts often lack explicit consideration of the cross-scale knowledge in large-scale earth observation scenarios, resulting in suboptimal single-scale representations in global and local modeling. The key motivation of this work is inspired by two observations: 1) There exists hierarchical features at the local and global regions in remote sensing images, and 2) they exhibit scale variation of similar ground objects (e.g. cross-scale similarity). In light of these, this paper presents an effective method to grasp the global and local image hierarchies by systematically exploring the cross-scale correlation. Specifically, we developed a Cross-scale Self-Attention (CSA) to model the global features, which introduces an auxiliary token space to calculate cross-scale self-attention matrices, thus exploring global dependency from diverse token scales. To extract the cross-scale localities, a Cross-scale Channel Attention (CCA) is devised, where multi-scale features are explored and progressively incorporated into an enriched feature. Moreover, by hierarchically deploying CSA and CCA into transformer groups, the proposed Cross-scale Hierarchical Transformer (CHT) can effectively explore cross-scale representations in remote sensing images, leading to a favorable reconstruction performance. Comprehensive experiments and analysis on four remote sensing datasets have demonstrated the superiority of CHT in both simulated and real-world remote sensing scenes. In particular, our CHT outperforms the state-of-the-art approach (TransENet) in terms of PSNR by 0.11dB on average, but only accounts for 54.8% of its parameters.
引用
收藏
页码:1914 / 1930
页数:17
相关论文
共 50 条
  • [21] Cross-MPI: Cross-scale Stereo for Image Super-Resolution using Multiplane Images
    Zhou, Yuemei
    Wu, Gaochang
    Fu, Ying
    Li, Kun
    Liu, Yebin
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 14837 - 14846
  • [22] Remote sensing image Super-resolution reconstruction by fusing multi-scale receptive fields and hybrid transformer
    Liu, Denghui
    Zhong, Lin
    Wu, Haiyang
    Li, Songyang
    Li, Yida
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [23] Feature Super-Resolution Fusion With Cross-Scale Distillation for Small-Object Detection in Optical Remote Sensing Images
    Gao, Yunxiao
    Wang, Yongcheng
    Zhang, Yuxi
    Li, Zheng
    Chen, Chi
    Feng, Hao
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [24] Activated Sparsely Sub-Pixel Transformer for Remote Sensing Image Super-Resolution
    Guo, Yongde
    Gong, Chengying
    Yan, Jun
    REMOTE SENSING, 2024, 16 (11)
  • [25] A lightweight distillation CNN-transformer architecture for remote sensing image super-resolution
    Wang, Yu
    Shao, Zhenfeng
    Lu, Tao
    Liu, Lifeng
    Huang, Xiao
    Wang, Jiaming
    Jiang, Kui
    Zeng, Kangli
    INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2023, 16 (01) : 3560 - 3579
  • [26] Cross-Scale Residual Network: A General Framework for Image Super-Resolution, Denoising, and Deblocking
    Zhou, Yuan
    Du, Xiaoting
    Wang, Mingfei
    Huo, Shuwei
    Zhang, Yeda
    Kung, Sun-Yuan
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (07) : 5855 - 5867
  • [27] CFDN: cross-scale feature distillation network for lightweight single image super-resolution
    Mu, Zihan
    Zhu, Ge
    Tang, Jinping
    MULTIMEDIA SYSTEMS, 2024, 30 (06)
  • [28] High-order cross-scale attention network for single image super-resolution
    Li, Tao
    Dong, Xiucheng
    Luo, Songning
    Fan, Zhiwei
    DIGITAL SIGNAL PROCESSING, 2022, 129
  • [29] GCPAN: an adaptive global cross-scale prior attention network for image super-resolution
    Mingzhu Shi
    Siqi Kong
    Bin Zao
    Muxian Tan
    Neural Computing and Applications, 2023, 35 : 17671 - 17688
  • [30] GCPAN: an adaptive global cross-scale prior attention network for image super-resolution
    Shi, Mingzhu
    Kong, Siqi
    Zao, Bin
    Tan, Muxian
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (24): : 17671 - 17688