A Unified Generative Adversarial Network With Convolution and Transformer for Remote Sensing Image Fusion

被引:1
作者
Wu, Yuanyuan [1 ,2 ]
Huang, Mengxing [1 ,3 ]
机构
[1] Hainan Univ, Sch Informat & Commun Engn, Haikou 570228, Peoples R China
[2] Guangdong Ocean Univ, Sch Elect & Informat Engn, Zhanjiang 524088, Peoples R China
[3] Hainan Univ, State Key Lab Marine Resource Utilizat South China, Haikou 570228, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
基金
中国国家自然科学基金;
关键词
Spatial resolution; Image resolution; Transformers; Generative adversarial networks; Biological system modeling; Pansharpening; Data models; Bidirectional local-global feature encoder; convolution and Transformer; multihead cross-attention fusion; multiresolution convolutional Transformer discriminators; remote sensing image (RSI) unified fusion model; SATELLITE IMAGES; LANDSAT; QUALITY; REFLECTANCE; FRAMEWORK; MODEL; MS;
D O I
10.1109/TGRS.2024.3441719
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Images derived from an individual sensor fail to simultaneously satisfy the demands of high spatial, spectral, and temporal resolutions. Multisource remote sensing image (RSI) fusion provides efficient access to high-spatial-resolution multispectral (HRMS) images [spatial-spectral fusion (SSF)] and high temporal- and spatial-resolution images [spatiotemporal fusion (STF)]. While existing deep learning (DL)-based models can mainly implement either SSF or STF, there is an urgent need for models that can simultaneously implement both SSF and STF. A unified generative adversarial network with convolution and Transformer (CTUGAN) for SSF and STF is proposed. CTUGAN contains a adaptive convolutional Transformer generator (ACTG) and multiresolution convolutional Transformer discriminator (MCTD), both with the convolution and Transformer. First, a bidirectional local-global feature encoder is devised in the ACTG to extract local-global features via a high-to-low resolution and a low-to-high resolution. Then, a multihead cross-attention fusion decoder (MCAFD) is devised to aggregate and fuse complementary local-global features of various levels and resolutions hierarchically to restore valuable information. Moreover, MCTDs adversely learn multiresolution local-global features to identify the relative reality of products, and a generalized loss function is built to accomplish full supervision. Finally, numerous experiments on the SSF data (Gaofen-2 (GF-2) and QuikBird) and STF data [Coleambally Irrigation Area (CIA) and lower Gwydir catchment (LGC)] demonstrate that the proposed CTUGAN model outperforms both subjective and objective evaluations.
引用
收藏
页数:22
相关论文
共 50 条
[41]   Convolution Transformer Fusion Splicing Network for Hyperspectral Image Classification [J].
Zhao, Feng ;
Li, Shijie ;
Zhang, Junjie ;
Liu, Hanqiang .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
[42]   Convolution Transformer Fusion Splicing Network for Hyperspectral Image Classification [J].
Zhao, Feng ;
Li, Shijie ;
Zhang, Junjie ;
Liu, Hanqiang .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
[43]   Single-Image Super-Resolution for Remote Sensing Images Using a Deep Generative Adversarial Network With Local and Global Attention Mechanisms [J].
Li, Yadong ;
Mavromatis, Sebastien ;
Zhang, Feng ;
Du, Zhenhong ;
Sequeira, Jean ;
Wang, Zhongyi ;
Zhao, Xianwei ;
Liu, Renyi .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[44]   An Unsupervised Multi-scale Generative Adversarial Network for Remote Sensing Image Pan-Sharpening [J].
Wang, Yajie ;
Xie, Yanyan ;
Wu, Yanyan ;
Liang, Kai ;
Qiao, Jilin .
MULTIMEDIA MODELING, MMM 2022, PT II, 2022, 13142 :356-368
[45]   Visual Haze Removal by a Unified Generative Adversarial Network [J].
Pang, Yanwei ;
Xie, Jin ;
Li, Xuelong .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (11) :3211-3221
[46]   Symmetrical lattice generative adversarial network for remote sensing images compression [J].
Zhao, Shihui ;
Yang, Shuyuan ;
Gu, Jing ;
Liu, Zhi ;
Feng, Zhixi .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 176 :169-181
[47]   Remote sensing image fusion via compressive sensing [J].
Ghahremani, Morteza ;
Liu, Yonghuai ;
Yuen, Peter ;
Behera, Ardhendu .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2019, 152 :34-48
[48]   Attention-Based Multistage Fusion Network for Remote Sensing Image Pansharpening [J].
Zhang, Wanwan ;
Li, Jinjiang ;
Hua, Zhen .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[49]   Remote Sensing Image Fusion Based on Two-Stream Fusion Network [J].
Liu, Xiangyu ;
Wang, Yunhong ;
Liu, Qingjie .
MULTIMEDIA MODELING, MMM 2018, PT I, 2018, 10704 :428-439
[50]   PSTAF-GAN: Progressive Spatio-Temporal Attention Fusion Method Based on Generative Adversarial Network [J].
Liu, Qiang ;
Meng, Xiangchao ;
Shao, Feng ;
Li, Shutao .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60