Remote sensing image super-resolution via cross-scale hierarchical transformer

被引:12
作者
Xiao, Yi [1 ]
Yuan, Qiangqiang [1 ]
He, Jiang [1 ]
Zhang, Liangpei [2 ]
机构
[1] Wuhan Univ, Sch Geodesy & Geomat, Wuhan, Peoples R China
[2] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
Super-resolution; transformer; cross-scale; hierarchical attention; remote sensing; OBJECT DETECTION; NETWORK; RESOLUTION;
D O I
10.1080/10095020.2023.2288179
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Global and local modeling is essential for image super-resolution tasks. However, current efforts often lack explicit consideration of the cross-scale knowledge in large-scale earth observation scenarios, resulting in suboptimal single-scale representations in global and local modeling. The key motivation of this work is inspired by two observations: 1) There exists hierarchical features at the local and global regions in remote sensing images, and 2) they exhibit scale variation of similar ground objects (e.g. cross-scale similarity). In light of these, this paper presents an effective method to grasp the global and local image hierarchies by systematically exploring the cross-scale correlation. Specifically, we developed a Cross-scale Self-Attention (CSA) to model the global features, which introduces an auxiliary token space to calculate cross-scale self-attention matrices, thus exploring global dependency from diverse token scales. To extract the cross-scale localities, a Cross-scale Channel Attention (CCA) is devised, where multi-scale features are explored and progressively incorporated into an enriched feature. Moreover, by hierarchically deploying CSA and CCA into transformer groups, the proposed Cross-scale Hierarchical Transformer (CHT) can effectively explore cross-scale representations in remote sensing images, leading to a favorable reconstruction performance. Comprehensive experiments and analysis on four remote sensing datasets have demonstrated the superiority of CHT in both simulated and real-world remote sensing scenes. In particular, our CHT outperforms the state-of-the-art approach (TransENet) in terms of PSNR by 0.11dB on average, but only accounts for 54.8% of its parameters.
引用
收藏
页码:1914 / 1930
页数:17
相关论文
共 81 条
[51]   SWCGAN: Generative Adversarial Network Combining Swin Transformer and CNN for Remote Sensing Image Super-Resolution [J].
Tu, Jingzhi ;
Mei, Gang ;
Ma, Zhengjing ;
Piccialli, Francesco .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 :5662-5673
[52]   STCD: efficient Siamese transformers-based change detection method for remote sensing images [J].
Wang, Decheng ;
Chen, Xiangning ;
Guo, Ningbo ;
Yi, Hui ;
Li, Yinan .
GEO-SPATIAL INFORMATION SCIENCE, 2024, 27 (04) :1192-1211
[53]   UNetFormer: A UNet-like transformer for efficient semantic segmentation of remote sensing urban scene imagery [J].
Wang, Libo ;
Li, Rui ;
Zhang, Ce ;
Fang, Shenghui ;
Duan, Chenxi ;
Meng, Xiaoliang ;
Atkinson, Peter M. .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2022, 190 :196-214
[54]  
Wang X.Q., 2018, P 8 INT C COMPUTATIO, P10
[55]   Global spatiotemporal completion of daily high-resolution TCCO from TROPOMI over land using a swath-based local ensemble learning method [J].
Wang, Yuan ;
Yuan, Qiangqiang ;
Zhou, Siqin ;
Zhang, Liangpei .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2022, 194 :167-180
[56]   FeNet: Feature Enhancement Network for Lightweight Remote-Sensing Image Super-Resolution [J].
Wang, Zheyuan ;
Li, Liangliang ;
Xue, Yuan ;
Jiang, Chenchen ;
Wang, Jiawen ;
Sun, Kaipeng ;
Ma, Hongbing .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[57]   UIU-Net: U-Net in U-Net for Infrared Small Object Detection [J].
Wu, Xin ;
Hong, Danfeng ;
Chanussot, Jocelyn .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 :364-376
[58]   Convolutional Neural Networks for Multimodal Remote Sensing Data Classification [J].
Wu, Xin ;
Hong, Danfeng ;
Chanussot, Jocelyn .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[59]   DOTA: A Large-scale Dataset for Object Detection in Aerial Images [J].
Xia, Gui-Song ;
Bai, Xiang ;
Ding, Jian ;
Zhu, Zhen ;
Belongie, Serge ;
Luo, Jiebo ;
Datcu, Mihai ;
Pelillo, Marcello ;
Zhang, Liangpei .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :3974-3983
[60]   AID: A Benchmark Data Set for Performance Evaluation of Aerial Scene Classification [J].
Xia, Gui-Song ;
Hu, Jingwen ;
Hu, Fan ;
Shi, Baoguang ;
Bai, Xiang ;
Zhong, Yanfei ;
Zhang, Liangpei ;
Lu, Xiaoqiang .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (07) :3965-3981