MSAN: Multiscale self-attention network for pansharpening

被引:3
作者
Lu, Hangyuan [1 ,2 ]
Yang, Yong [3 ]
Huang, Shuying [4 ]
Liu, Rixian [1 ,2 ]
Guo, Huimin [5 ]
机构
[1] Jinhua Univ Vocat Technol, Coll Informat Engn, Jinhua 321007, Peoples R China
[2] Jinhua Univ Vocat Technol, Key Lab Crop Harvesting Equipment Technol Zhejiang, Jinhua 321007, Peoples R China
[3] Tiangong Univ, Sch Comp Sci & Technol, Tianjin 300387, Peoples R China
[4] Tiangong Univ, Sch Software, Tianjin 300387, Peoples R China
[5] Xiamen Univ, Sch Informat, Xiamen 361005, Peoples R China
基金
中国国家自然科学基金;
关键词
Pansharpening; Multiscale; Self-attention; Swin Transformer; FUSION; IMAGES;
D O I
10.1016/j.patcog.2025.111441
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Effective extraction of spectral-spatial features from multispectral (MS) and panchromatic (PAN) images is critical for high-quality pansharpening. However, existing deep learning methods often overlook local misalignment and struggle to integrate local and long-range features effectively, resulting in spectral and spatial distortions. To address these challenges, this paper proposes a refined detail injection model that adaptively learns injection coefficients using long-range features. Building upon this model, a multiscale self-attention network (MSAN) is proposed, consisting of a feature extraction branch and a self-attention mechanism branch. In the former branch, a two-stage multiscale convolution network is designed to fully extract detail features with multiple receptive fields. In the latter branch, a streamlined Swin Transformer (SST) is proposed to efficiently generate multiscale self-attention maps by learning the correlation between local and long-range features. To better preserve spectral-spatial information, a revised Swin Transformer block is proposed by incorporating spectral and spatial attention within the block. The obtained self-attention maps from SST serve as the injection coefficients to refine the extracted details, which are then injected into the upsampled MS image to produce the final fused image. Experimental validation demonstrates the superiority of MSAN over traditional and state-of-the-art methods, with competitive efficiency. The code of this work will be released on GitHub once the paper is accepted.
引用
收藏
页数:17
相关论文
共 39 条
[31]   VONet: An Adaptive Approach Using Variational Optimization and Deep Learning for Panchromatic Sharpening [J].
Wu, Zhong-Cheng ;
Huang, Ting-Zhu ;
Deng, Liang-Jian ;
Hu, Jin-Fan ;
Vivone, Gemine .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[32]   A New Context-Aware Details Injection Fidelity With Adaptive Coefficients Estimation for Variational Pansharpening [J].
Xiao, Jin-Liang ;
Huang, Ting-Zhu ;
Deng, Liang-Jian ;
Wu, Zhong-Cheng ;
Vivone, Gemine .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[33]   PanNet: A deep network architecture for pan-sharpening [J].
Yang, Junfeng ;
Fu, Xueyang ;
Hu, Yuwen ;
Huang, Yue ;
Ding, Xinghao ;
Paisley, John .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :1753-1761
[34]   Pansharpening Based on Variational Fractional-Order Geometry Model and Optimized Injection Gains [J].
Yang, Yong ;
Lu, Hangyuan ;
Huang, Shuying ;
Wan, Weiguo ;
Li, Luyi .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 :2128-2141
[35]   Dual-Stream Convolutional Neural Network With Residual Information Enhancement for Pansharpening [J].
Yang, Yong ;
Tu, Wei ;
Huang, Shuying ;
Lu, Hangyuan ;
Wan, Weiguo ;
Gan, Lixin .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[36]   Pansharpening Based on Joint-Guided Detail Extraction [J].
Yang, Yong ;
Lu, Hangyuan ;
Huang, Shuying ;
Tu, Wei .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 :389-401
[37]   An efficient and high-quality pansharpening model based on conditional random fields [J].
Yang, Yong ;
Lu, Hangyuan ;
Huang, Shuying ;
Fang, Yuming ;
Tu, Wei .
INFORMATION SCIENCES, 2021, 553 :1-18
[38]   A Triple-Double Convolutional Neural Network for Panchromatic Sharpening [J].
Zhang, Tian-Jiang ;
Deng, Liang-Jian ;
Huang, Ting-Zhu ;
Chanussot, Jocelyn ;
Vivone, Gemine .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) :9088-9101
[39]   A GIHS-based spectral preservation fusion method for remote sensing images using edge restored spectral modulation [J].
Zhou, Xiran ;
Liu, Jun ;
Liu, Shuguang ;
Cao, Lei ;
Zhou, Qiming ;
Huang, Huawen .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 88 :16-27