Mutiscale Hybrid Attention Transformer for Remote Sensing Image Pansharpening

被引:22
作者
Zhu, Wengang [1 ]
Li, Jinjiang [1 ]
An, Zhiyong [1 ]
Hua, Zhen [2 ]
机构
[1] Shandong Technol & Business Univ, Sch Comp Sci & Technol, Yantai 264005, Peoples R China
[2] Shandong Technol & Business Univ, Sch Informat & Elect Engn, Yantai 264005, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2023年 / 61卷
基金
中国国家自然科学基金;
关键词
Convolutional block attention module (CBAM); convolutional neural network (CNN); deep semantic statistics matching (D2SM) loss; hybrid attention mechanism; pansharpening; Transformer; PAN-SHARPENING METHOD; FUSION; QUALITY; MODEL; MS;
D O I
10.1109/TGRS.2023.3239013
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Pansharpening methods play a crucial role for remote sensing image processing. The existing pansharpening methods, in general, have the problems of spectral distortion and lack of spatial detail information. To mitigate these problems, we propose a multiscale hybrid attention Transformer pansharpening network (MHATP-Net). In the proposed network, the shallow feature (SF) is first acquired through an SF extraction module (SFEM), which contains the convolutional block attention module (CBAM) and dynamic convolution blocks. The CBAM in this module can filter initial information roughly, and the dynamic convolution blocks can enrich the SF information. Then, the multiscale Transformer module is used to obtain multiencoding feature images. We propose a hybrid attention module (HAM) in the multiscale feature recovery module to effectively address the balance between the spectral feature retention and the spatial feature recovery. In the training process, we use deep semantic statistics matching (D2SM) loss to optimize the output model. We have conducted extensive experiments on several known datasets, and the results show that this article has good performance compared with other state of the art (SOTA) methods.
引用
收藏
页数:16
相关论文
共 80 条
[1]   MTF-tailored multiscale fusion of high-resolution MS and pan imagery [J].
Aiazzi, B. ;
Alparone, L. ;
Baronti, S. ;
Garzelli, A. ;
Selva, M. .
PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 2006, 72 (05) :591-596
[2]   Improving component substitution pansharpening through multivariate regression of MS plus Pan data [J].
Aiazzi, Bruno ;
Baronti, Stefano ;
Selva, Massimo .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2007, 45 (10) :3230-3239
[3]   Albedo Retrieval From Multispectral Landsat 8 Observation in Urban Environment: Algorithm Validation by in situ Measurements [J].
Baldinelli, Giorgio ;
Bonafoni, Stefania ;
Rotili, Antonella .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2017, 10 (10) :4504-4511
[4]   A variational model for P+XS image fusion [J].
Ballester, Coloma ;
Caselles, Vicent ;
Igual, Laura ;
Verdera, Joan ;
Rougé, Bernard .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2006, 69 (01) :43-58
[5]   Super-Resolution-Guided Progressive Pansharpening Based on a Deep Convolutional Neural Network [J].
Cai, Jiajun ;
Huang, Bo .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (06) :5206-5220
[6]   End-to-End Object Detection with Transformers [J].
Carion, Nicolas ;
Massa, Francisco ;
Synnaeve, Gabriel ;
Usunier, Nicolas ;
Kirillov, Alexander ;
Zagoruyko, Sergey .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229
[7]  
CARPER WJ, 1990, PHOTOGRAMM ENG REM S, V56, P459
[8]  
CHAVEZ PS, 1989, PHOTOGRAMM ENG REM S, V55, P339
[9]  
CHAVEZ PS, 1991, PHOTOGRAMM ENG REM S, V57, P295
[10]   Pre-Trained Image Processing Transformer [J].
Chen, Hanting ;
Wang, Yunhe ;
Guo, Tianyu ;
Xu, Chang ;
Deng, Yiping ;
Liu, Zhenhua ;
Ma, Siwei ;
Xu, Chunjing ;
Xu, Chao ;
Gao, Wen .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :12294-12305