Transformer-based dual path cross fusion for pansharpening remote sensing images

被引:0
作者
Li, Zixu [1 ,2 ]
Li, Jinjiang [2 ,3 ]
Ren, Lu [1 ,2 ]
Chen, Zheng [2 ,3 ,4 ]
机构
[1] Shandong Technol & Business Univ, Sch Informat & Elect Engn, Yantai, Peoples R China
[2] Coinnovat Ctr Shandong Coll & Univ Future Intellig, Yantai, Peoples R China
[3] Shandong Technol & Business Univ, Sch Comp Sci & Technol, Yantai, Peoples R China
[4] Shandong Technol & Business Univ, Sch Comp Sci & Technol, 191 Binhai Middle Rd, Yantai 264005, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
Pansharpening; convolutional neural network; transformer; multispectral image; panchromatic image; PAN-SHARPENING METHOD; ALGORITHM;
D O I
10.1080/01431161.2024.2306153
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
The purpose of pan-sharpening is to generate high-resolution multispectral (HRMS) images by combining multispectral images (MS) and panchromatic images (PAN), which have become an important part of remote sensing image processing. Therefore, how to better extract complete feature information from MS images and PAN images has become the focus of our attention. In this paper, we propose a new pansharpening method, called Transformer-based Dual-path cross fusion network (TDF) for Pan-sharpening remote sensing images, which aims to extract the spatial details of PAN images while maintaining the spectral fidelity of MS images. The whole network structure can be divided into two parts: in the encoder part, we adopt the Swin-Transformer module for the downsampling operation, which expands the sensory field of the network to the feature map, and then extracts the global information. However, since the Swin-Transformer module is not good at extracting pixel-level details, we introduce the Base Feature Extraction (BFE) and the Invertible Neural Network Block (INNB) modules for the interaction between the local and the global feature information. we also introduce the Edge-Enhancement Block (EEB) to further enhance the feature extraction at multi-scales during the image fusion process. In the decoder section, we once again employ the Swin Transformer module for downsampling. After the convolution operation and activation function, we utilize the Sub-Pixel Convolutional Neural Network for upsampling to generate the ultimate high-resolution multispectral images. Simulation experiments and real experiments are conducted on QuickBird (QB) and WorldView2 (WV2) datasets, which demonstrated our method are superior to the current methods.
引用
收藏
页码:1170 / 1200
页数:31
相关论文
共 59 条
[1]   An MTF-based spectral distortion minimizing model for pan-sharpening of very high resolution multispectral images of urban areas [J].
Aiazzi, B ;
Alparone, L ;
Baronti, S ;
Garzelli, A ;
Selva, M .
2ND GRSS/ISPRS JOINT WORKSHOP ON REMOTE SENSING AND DATA FUSION OVER URBAN AREAS, 2003, :90-94
[2]   Improving component substitution pansharpening through multivariate regression of MS plus Pan data [J].
Aiazzi, Bruno ;
Baronti, Stefano ;
Selva, Massimo .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2007, 45 (10) :3230-3239
[3]  
Alparone L, 2003, INT GEOSCI REMOTE SE, P458
[4]   A variational model for P+XS image fusion [J].
Ballester, Coloma ;
Caselles, Vicent ;
Igual, Laura ;
Verdera, Joan ;
Rougé, Bernard .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2006, 69 (01) :43-58
[5]  
CARPER WJ, 1990, PHOTOGRAMM ENG REM S, V56, P459
[6]  
CHAVEZ PS, 1991, PHOTOGRAMM ENG REM S, V57, P295
[7]   Image Fusion with Local Spectral Consistency and Dynamic Gradient Sparsity [J].
Chen, Chen ;
Li, Yeqing ;
Liu, Wei ;
Huang, Junzhou .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :2760-2765
[8]   A variational method for multisource remote-sensing image fusion [J].
Fang, Faming ;
Li, Fang ;
Zhang, Guixu ;
Shen, Chaomin .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2013, 34 (07) :2470-2486
[9]   Convolutional Sparse Representation of Injected Details for Pansharpening [J].
Fei, Rongrong ;
Zhang, Jiangshe ;
Liu, Junmin ;
Du, Fang ;
Chang, Peiju ;
Hu, Junying .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2019, 16 (10) :1595-1599
[10]   Optimal MMSE pan sharpening of very high resolution multispectral images [J].
Garzelli, Andrea ;
Nencini, Filippo ;
Capobianco, Luca .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2008, 46 (01) :228-236