A Unified Generative Adversarial Network With Convolution and Transformer for Remote Sensing Image Fusion

被引：1

作者：

Wu, Yuanyuan ^{[1
,2
]}

Huang, Mengxing ^{[1
,3
]}

机构：

[1] Hainan Univ, Sch Informat & Commun Engn, Haikou 570228, Peoples R China

[2] Guangdong Ocean Univ, Sch Elect & Informat Engn, Zhanjiang 524088, Peoples R China

[3] Hainan Univ, State Key Lab Marine Resource Utilizat South China, Haikou 570228, Peoples R China

来源：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷

基金：

中国国家自然科学基金;

关键词：

Spatial resolution; Image resolution; Transformers; Generative adversarial networks; Biological system modeling; Pansharpening; Data models; Bidirectional local-global feature encoder; convolution and Transformer; multihead cross-attention fusion; multiresolution convolutional Transformer discriminators; remote sensing image (RSI) unified fusion model; SATELLITE IMAGES; LANDSAT; QUALITY; REFLECTANCE; FRAMEWORK; MODEL; MS;

D O I：

10.1109/TGRS.2024.3441719

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

Images derived from an individual sensor fail to simultaneously satisfy the demands of high spatial, spectral, and temporal resolutions. Multisource remote sensing image (RSI) fusion provides efficient access to high-spatial-resolution multispectral (HRMS) images [spatial-spectral fusion (SSF)] and high temporal- and spatial-resolution images [spatiotemporal fusion (STF)]. While existing deep learning (DL)-based models can mainly implement either SSF or STF, there is an urgent need for models that can simultaneously implement both SSF and STF. A unified generative adversarial network with convolution and Transformer (CTUGAN) for SSF and STF is proposed. CTUGAN contains a adaptive convolutional Transformer generator (ACTG) and multiresolution convolutional Transformer discriminator (MCTD), both with the convolution and Transformer. First, a bidirectional local-global feature encoder is devised in the ACTG to extract local-global features via a high-to-low resolution and a low-to-high resolution. Then, a multihead cross-attention fusion decoder (MCAFD) is devised to aggregate and fuse complementary local-global features of various levels and resolutions hierarchically to restore valuable information. Moreover, MCTDs adversely learn multiresolution local-global features to identify the relative reality of products, and a generalized loss function is built to accomplish full supervision. Finally, numerous experiments on the SSF data (Gaofen-2 (GF-2) and QuikBird) and STF data [Coleambally Irrigation Area (CIA) and lower Gwydir catchment (LGC)] demonstrate that the proposed CTUGAN model outperforms both subjective and objective evaluations.

引用

页数：22

共 50 条

[41] Convolution Transformer Fusion Splicing Network for Hyperspectral Image Classification [J].

Zhao, Feng ;

Li, Shijie ;

Zhang, Junjie ;

Liu, Hanqiang .

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20

[42] Convolution Transformer Fusion Splicing Network for Hyperspectral Image Classification [J].

Zhao, Feng ;

Li, Shijie ;

Zhang, Junjie ;

Liu, Hanqiang .

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20

[43] Single-Image Super-Resolution for Remote Sensing Images Using a Deep Generative Adversarial Network With Local and Global Attention Mechanisms [J].

Li, Yadong ;

Mavromatis, Sebastien ;

Zhang, Feng ;

Du, Zhenhong ;

Sequeira, Jean ;

Wang, Zhongyi ;

Zhao, Xianwei ;

Liu, Renyi .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60

[44] An Unsupervised Multi-scale Generative Adversarial Network for Remote Sensing Image Pan-Sharpening [J].

Wang, Yajie ;

Xie, Yanyan ;

Wu, Yanyan ;

Liang, Kai ;

Qiao, Jilin .

MULTIMEDIA MODELING, MMM 2022, PT II, 2022, 13142 :356-368

[45] Visual Haze Removal by a Unified Generative Adversarial Network [J].

Pang, Yanwei ;

Xie, Jin ;

Li, Xuelong .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (11) :3211-3221

[46] Symmetrical lattice generative adversarial network for remote sensing images compression [J].

Zhao, Shihui ;

Yang, Shuyuan ;

Gu, Jing ;

Liu, Zhi ;

Feng, Zhixi .

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 176 :169-181

[47] Remote sensing image fusion via compressive sensing [J].

Ghahremani, Morteza ;

Liu, Yonghuai ;

Yuen, Peter ;

Behera, Ardhendu .

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2019, 152 :34-48

[48] Attention-Based Multistage Fusion Network for Remote Sensing Image Pansharpening [J].

Zhang, Wanwan ;

Li, Jinjiang ;

Hua, Zhen .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60

[49] Remote Sensing Image Fusion Based on Two-Stream Fusion Network [J].

Liu, Xiangyu ;

Wang, Yunhong ;

Liu, Qingjie .

MULTIMEDIA MODELING, MMM 2018, PT I, 2018, 10704 :428-439

[50] PSTAF-GAN: Progressive Spatio-Temporal Attention Fusion Method Based on Generative Adversarial Network [J].

Liu, Qiang ;

Meng, Xiangchao ;

Shao, Feng ;

Li, Shutao .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60

← 1 2 3 4 5 →