Remote sensing image super-resolution via cross-scale hierarchical transformer

被引:8
|
作者
Xiao, Yi [1 ]
Yuan, Qiangqiang [1 ]
He, Jiang [1 ]
Zhang, Liangpei [2 ]
机构
[1] Wuhan Univ, Sch Geodesy & Geomat, Wuhan, Peoples R China
[2] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
Super-resolution; transformer; cross-scale; hierarchical attention; remote sensing; OBJECT DETECTION; NETWORK; RESOLUTION;
D O I
10.1080/10095020.2023.2288179
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Global and local modeling is essential for image super-resolution tasks. However, current efforts often lack explicit consideration of the cross-scale knowledge in large-scale earth observation scenarios, resulting in suboptimal single-scale representations in global and local modeling. The key motivation of this work is inspired by two observations: 1) There exists hierarchical features at the local and global regions in remote sensing images, and 2) they exhibit scale variation of similar ground objects (e.g. cross-scale similarity). In light of these, this paper presents an effective method to grasp the global and local image hierarchies by systematically exploring the cross-scale correlation. Specifically, we developed a Cross-scale Self-Attention (CSA) to model the global features, which introduces an auxiliary token space to calculate cross-scale self-attention matrices, thus exploring global dependency from diverse token scales. To extract the cross-scale localities, a Cross-scale Channel Attention (CCA) is devised, where multi-scale features are explored and progressively incorporated into an enriched feature. Moreover, by hierarchically deploying CSA and CCA into transformer groups, the proposed Cross-scale Hierarchical Transformer (CHT) can effectively explore cross-scale representations in remote sensing images, leading to a favorable reconstruction performance. Comprehensive experiments and analysis on four remote sensing datasets have demonstrated the superiority of CHT in both simulated and real-world remote sensing scenes. In particular, our CHT outperforms the state-of-the-art approach (TransENet) in terms of PSNR by 0.11dB on average, but only accounts for 54.8% of its parameters.
引用
收藏
页码:1914 / 1930
页数:17
相关论文
共 50 条
  • [1] Remote sensing image super-resolution via cross-scale hierarchical transformer
    Xiao, Yi
    Yuan, Qiangqiang
    He, Jiang
    Zhang, Liangpei
    GEO-SPATIAL INFORMATION SCIENCE, 2024, 27 (06) : 1914 - 1930
  • [2] Hybrid-Scale Hierarchical Transformer for Remote Sensing Image Super-Resolution
    Shang, Jianrun
    Gao, Mingliang
    Li, Qilei
    Pan, Jinfeng
    Zou, Guofeng
    Jeon, Gwanggil
    REMOTE SENSING, 2023, 15 (13)
  • [3] Efficient Swin Transformer for Remote Sensing Image Super-Resolution
    Kang, Xudong
    Duan, Puhong
    Li, Jier
    Li, Shutao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 6367 - 6379
  • [4] Scale-Aware Backprojection Transformer for Single Remote Sensing Image Super-Resolution
    Hao, Jinglei
    Li, Wukai
    Lu, Yuting
    Jin, Yang
    Zhao, Yongqiang
    Wang, Shunzhou
    Wang, Binglu
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [5] Multiple Hierarchical Cross-Scale Transformer for Remote Sensing Scene Classification
    Zhang, Dan
    Ma, Wenping
    Jiao, Licheng
    Liu, Xu
    Yang, Yuting
    Liu, Fang
    REMOTE SENSING, 2025, 17 (01)
  • [6] A spectral and spatial transformer for hyperspectral remote sensing image super-resolution
    Wang, Bingqian
    Chen, Jianhua
    Wang, Huajun
    Tang, Yipeng
    Chen, Jiongling
    Jiang, Ye
    INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2024, 17 (01)
  • [7] 3D CROSS-SCALE FEATURE TRANSFORMER NETWORK FOR BRAIN MR IMAGE SUPER-RESOLUTION
    Zhang, Wanqi
    Wang, Lulu
    Chen, Wei
    Jia, Yuanyuan
    He, Zhongshi
    Du, Jinglong
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1356 - 1360
  • [8] CSINet: A Cross-Scale Interaction Network for Lightweight Image Super-Resolution
    Ke, Gang
    Lo, Sio-Long
    Zou, Hua
    Liu, Yi-Feng
    Chen, Zhen-Qiang
    Wang, Jing-Kai
    SENSORS, 2024, 24 (04)
  • [9] Learning Multi-Modal Cross-Scale Deformable Transformer Network for Unregistered Hyperspectral Image Super-resolution
    Dong, Wenqian
    Xu, Yang
    Qu, Jiahui
    Hou, Shaoxiong
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 1573 - 1581
  • [10] Super-resolution Reconstruction of Remote Sensing Image Based on Transformer of Multi-scale Feature Fusion
    Wang, Zhi
    Wang, Kun
    Wang, Meng-Qing
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2024, 45 (08): : 1178 - 1184