MSAN: Multiscale self-attention network for pansharpening

被引:3
作者
Lu, Hangyuan [1 ,2 ]
Yang, Yong [3 ]
Huang, Shuying [4 ]
Liu, Rixian [1 ,2 ]
Guo, Huimin [5 ]
机构
[1] Jinhua Univ Vocat Technol, Coll Informat Engn, Jinhua 321007, Peoples R China
[2] Jinhua Univ Vocat Technol, Key Lab Crop Harvesting Equipment Technol Zhejiang, Jinhua 321007, Peoples R China
[3] Tiangong Univ, Sch Comp Sci & Technol, Tianjin 300387, Peoples R China
[4] Tiangong Univ, Sch Software, Tianjin 300387, Peoples R China
[5] Xiamen Univ, Sch Informat, Xiamen 361005, Peoples R China
基金
中国国家自然科学基金;
关键词
Pansharpening; Multiscale; Self-attention; Swin Transformer; FUSION; IMAGES;
D O I
10.1016/j.patcog.2025.111441
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Effective extraction of spectral-spatial features from multispectral (MS) and panchromatic (PAN) images is critical for high-quality pansharpening. However, existing deep learning methods often overlook local misalignment and struggle to integrate local and long-range features effectively, resulting in spectral and spatial distortions. To address these challenges, this paper proposes a refined detail injection model that adaptively learns injection coefficients using long-range features. Building upon this model, a multiscale self-attention network (MSAN) is proposed, consisting of a feature extraction branch and a self-attention mechanism branch. In the former branch, a two-stage multiscale convolution network is designed to fully extract detail features with multiple receptive fields. In the latter branch, a streamlined Swin Transformer (SST) is proposed to efficiently generate multiscale self-attention maps by learning the correlation between local and long-range features. To better preserve spectral-spatial information, a revised Swin Transformer block is proposed by incorporating spectral and spatial attention within the block. The obtained self-attention maps from SST serve as the injection coefficients to refine the extracted details, which are then injected into the upsampled MS image to produce the final fused image. Experimental validation demonstrates the superiority of MSAN over traditional and state-of-the-art methods, with competitive efficiency. The code of this work will be released on GitHub once the paper is accepted.
引用
收藏
页数:17
相关论文
共 39 条
[1]   Context-driven fusion of high spatial and spectral resolution images based on oversampled multiresolution analysis [J].
Aiazzi, B ;
Alparone, L ;
Baronti, S ;
Garzelli, A .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2002, 40 (10) :2300-2312
[2]   Improving component substitution pansharpening through multivariate regression of MS plus Pan data [J].
Aiazzi, Bruno ;
Baronti, Stefano ;
Selva, Massimo .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2007, 45 (10) :3230-3239
[3]   HyperTransformer: A Textural and Spectral Feature Fusion Transformer for Pansharpening [J].
Bandara, Wele Gedara Chaminda ;
Patel, Vishal M. .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :1757-1767
[4]   SIRF: Simultaneous Satellite Image Registration and Fusion in a Unified Framework [J].
Chen, Chen ;
Li, Yeqing ;
Liu, Wei ;
Huang, Junzhou .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (11) :4213-4224
[5]   Machine Learning in Pansharpening: A Benchmark, From Shallow to Deep Networks [J].
Deng, Liang-Jian ;
Vivone, Gemine ;
Paoletti, Mercedes ;
Scarpa, Giuseppe ;
He, Jiang ;
Zhang, Yongjun ;
Chanussot, Jocelyn ;
Plaza, Antonio J. .
IEEE GEOSCIENCE AND REMOTE SENSING MAGAZINE, 2022, 10 (03) :279-315
[6]   Detail Injection-Based Deep Convolutional Neural Networks for Pansharpening [J].
Deng, Liang-Jian ;
Vivone, Gemine ;
Jin, Cheng ;
Chanussot, Jocelyn .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (08) :6995-7010
[7]   Cross-modality attentive feature fusion for object detection in multispectral remote sensing imagery [J].
Fang Qingyun ;
Wang Zhaokui .
PATTERN RECOGNITION, 2022, 130
[8]   Optimal MMSE pan sharpening of very high resolution multispectral images [J].
Garzelli, Andrea ;
Nencini, Filippo ;
Capobianco, Luca .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2008, 46 (01) :228-236
[9]   Self-Supervised Interactive Dual-Stream Network for Pansharpening [J].
Guo, Qing ;
Jia, He ;
Yang, Shengsang .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 :9928-9943
[10]  
He XH, 2024, Arxiv, DOI [arXiv:2402.12192, DOI 10.1016/J.INFFUS.2024.102779]