MIMO-SST: Multi-Input Multi-Output Spatial-Spectral Transformer for Hyperspectral and Multispectral Image Fusion

被引:23
作者
Fang, Jian [1 ]
Yang, Jingxiang [1 ]
Khader, Abdolraheem [1 ]
Xiao, Liang [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
基金
中国国家自然科学基金;
关键词
Hyperspectral image (HSI) fusion; multi-input multi-output; Transformer; QUALITY; NETWORK;
D O I
10.1109/TGRS.2024.3361553
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
The current advanced hyperspectral super-resolution methods utilize convolutional neural networks (CNNs) that are either deeper or wider. These networks are designed to acquire end-to-end mapping capability, facilitating the transformation from low-resolution hyperspectral images (LR-HSIs) and high-resolution multispectral images (HR-MSIs) to high-resolution HSIs (HR-HSIs). The existing methods lack the capability to capture details and structures in the image effectively, while multi-input multi-output methods can address this issue efficiently. Therefore, this article proposes a novel network architecture named multi-input multi-output spatial-spectral transformer (MIMO-SST). To apply the multi-input multi-output methods in HSI fusion, specifically integrating the spatial-spectral information of LR-HSI and HR-MSI, we introduce multihead feature map attention, multihead feature channel attention, and a multiscale convolutional gated feedforward network, constructing the proposed mixture spatial-spectral Transformer. Moreover, to enhance the expressive power of image edges and recover the sharpened structure details, this study incorporates a novel wavelet-based high-frequency loss into the ultimate comprehensive loss, with the objective of refining the reconstruction of high-frequency details. Experimental studies on three simulated datasets and one real-world dataset demonstrate that the proposed method in this study outperforms contemporary state-of-the-art methods in terms of performance. It is noteworthy that our method exhibits a 0.85-dB improvement in terms of the peak signal-to-noise ratio (PSNR) metric on the Columbia computer vision laboratory (CAVE) dataset compared to state-of-the-art methods. Our code is publicly available at https://github.com/Freelancefangjian/MIMO-SST.
引用
收藏
页码:1 / 20
页数:20
相关论文
共 71 条
[1]  
Akhtar N, 2015, PROC CVPR IEEE, P3631, DOI 10.1109/CVPR.2015.7298986
[2]   Attention Augmented Convolutional Networks [J].
Bello, Irwan ;
Zoph, Barret ;
Vaswani, Ashish ;
Shlens, Jonathon ;
Le, Quoc V. .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3285-3294
[3]   Hyperspectral Remote Sensing Data Analysis and Future Challenges [J].
Bioucas-Dias, Jose M. ;
Plaza, Antonio ;
Camps-Valls, Gustavo ;
Scheunders, Paul ;
Nasrabadi, Nasser M. ;
Chanussot, Jocelyn .
IEEE GEOSCIENCE AND REMOTE SENSING MAGAZINE, 2013, 1 (02) :6-36
[4]   Super-Resolution for Hyperspectral and Multispectral Image Fusion Accounting for Seasonal Spectral Variability [J].
Borsoi, Ricardo Augusto ;
Imbiriba, Tales ;
Moreira Bermudez, Jose Carlos .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :116-127
[5]   HSI-DeNet: Hyperspectral Image Restoration via Convolutional Neural Network [J].
Chang, Yi ;
Yan, Luxin ;
Fang, Houzhang ;
Zhong, Sheng ;
Liao, Wenshan .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (02) :667-682
[6]   Pre-Trained Image Processing Transformer [J].
Chen, Hanting ;
Wang, Yunhe ;
Guo, Tianyu ;
Xu, Chang ;
Deng, Yiping ;
Liu, Zhenhua ;
Ma, Siwei ;
Xu, Chunjing ;
Xu, Chao ;
Gao, Wen .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :12294-12305
[7]   Combining Low-Rank and Deep Plug-and-Play Priors for Snapshot Compressive Imaging [J].
Chen, Yong ;
Gui, Xinfeng ;
Zeng, Jinshan ;
Zhao, Xi-Le ;
He, Wei .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (11) :16396-16408
[8]   Hyperspectral and Multispectral Image Fusion Using Factor Smoothed Tensor Ring Decomposition [J].
Chen, Yong ;
Zeng, Jinshan ;
He, Wei ;
Zhao, Xi-Le ;
Huang, Ting-Zhu .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[9]   Rethinking Coarse-to-Fine Approach in Single Image Deblurring [J].
Cho, Sung-Jin ;
Ji, Seo-Won ;
Hong, Jun-Pyo ;
Jung, Seung-Won ;
Ko, Sung-Jea .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :4621-4630
[10]   Deep Metric Learning-Based Feature Embedding for Hyperspectral Image Classification [J].
Deng, Bin ;
Jia, Sen ;
Shi, Daming .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (02) :1422-1435