MIMO-SST: Multi-Input Multi-Output Spatial-Spectral Transformer for Hyperspectral and Multispectral Image Fusion

被引:23
作者
Fang, Jian [1 ]
Yang, Jingxiang [1 ]
Khader, Abdolraheem [1 ]
Xiao, Liang [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
基金
中国国家自然科学基金;
关键词
Hyperspectral image (HSI) fusion; multi-input multi-output; Transformer; QUALITY; NETWORK;
D O I
10.1109/TGRS.2024.3361553
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
The current advanced hyperspectral super-resolution methods utilize convolutional neural networks (CNNs) that are either deeper or wider. These networks are designed to acquire end-to-end mapping capability, facilitating the transformation from low-resolution hyperspectral images (LR-HSIs) and high-resolution multispectral images (HR-MSIs) to high-resolution HSIs (HR-HSIs). The existing methods lack the capability to capture details and structures in the image effectively, while multi-input multi-output methods can address this issue efficiently. Therefore, this article proposes a novel network architecture named multi-input multi-output spatial-spectral transformer (MIMO-SST). To apply the multi-input multi-output methods in HSI fusion, specifically integrating the spatial-spectral information of LR-HSI and HR-MSI, we introduce multihead feature map attention, multihead feature channel attention, and a multiscale convolutional gated feedforward network, constructing the proposed mixture spatial-spectral Transformer. Moreover, to enhance the expressive power of image edges and recover the sharpened structure details, this study incorporates a novel wavelet-based high-frequency loss into the ultimate comprehensive loss, with the objective of refining the reconstruction of high-frequency details. Experimental studies on three simulated datasets and one real-world dataset demonstrate that the proposed method in this study outperforms contemporary state-of-the-art methods in terms of performance. It is noteworthy that our method exhibits a 0.85-dB improvement in terms of the peak signal-to-noise ratio (PSNR) metric on the Columbia computer vision laboratory (CAVE) dataset compared to state-of-the-art methods. Our code is publicly available at https://github.com/Freelancefangjian/MIMO-SST.
引用
收藏
页码:1 / 20
页数:20
相关论文
共 71 条
[11]   PSRT: Pyramid Shuffle-and-Reshuffle Transformer for Multispectral and Hyperspectral Image Fusion [J].
Deng, Shang-Qi ;
Deng, Liang-Jian ;
Wu, Xiao ;
Ran, Ran ;
Hong, Danfeng ;
Vivone, Gemine .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[12]   Recent advances and new guidelines on hyperspectral and multispectral image fusion [J].
Dian, Renwei ;
Li, Shutao ;
Sun, Bin ;
Guo, Anjing .
INFORMATION FUSION, 2021, 69 :40-51
[13]   Regularizing Hyperspectral and Multispectral Image Fusion by CNN Denoiser [J].
Dian, Renwei ;
Li, Shutao ;
Kang, Xudong .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (03) :1124-1135
[14]   Learning a Low Tensor-Train Rank Representation for Hyperspectral Image Super-Resolution [J].
Dian, Renwei ;
Li, Shutao ;
Fang, Leyuan .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (09) :2672-2683
[15]   Deep Hyperspectral Image Sharpening [J].
Dian, Renwei ;
Li, Shutao ;
Guo, Anjing ;
Fang, Leyuan .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (11) :5345-5355
[16]   Hyperspectral image super-resolution via non-local sparse tensor factorization [J].
Dian, Renwei ;
Fang, Leyuan ;
Li, Shutao .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3862-3871
[17]   Model-Guided Deep Hyperspectral Image Super-Resolution [J].
Dong, Weisheng ;
Zhou, Chen ;
Wu, Fangfang ;
Wu, Jinjian ;
Shi, Guangming ;
Li, Xin .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 (30) :5754-5768
[18]   Hyperspectral Image Super-Resolution via Non-Negative Structured Sparse Representation [J].
Dong, Weisheng ;
Fu, Fazuo ;
Shi, Guangming ;
Cao, Xun ;
Wu, Jinjian ;
Li, Guangyu ;
Li, Xin .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (05) :2337-2352
[19]  
Dosovitskiy A, 2021, Arxiv, DOI [arXiv:2010.11929, DOI 10.48550/ARXIV.2010.11929]
[20]   Advances in Hyperspectral Image and Signal Processing A comprehensive overview of the state of the art [J].
Ghamisi, Pedram ;
Yokoya, Naoto ;
Li, Jun ;
Liao, Wenzhi ;
Liu, Sicong ;
Plaza, Javier ;
Rasti, Behnood ;
Plaza, Antonio .
IEEE GEOSCIENCE AND REMOTE SENSING MAGAZINE, 2017, 5 (04) :37-78