Remote Sensing Image Synthesis via Semantic Embedding Generative Adversarial Networks

被引:9
作者
Wang, Chendan [1 ,2 ]
Chen, Bowen [1 ,2 ]
Zou, Zhengxia [3 ,4 ]
Shi, Zhenwei [1 ,2 ]
机构
[1] Beihang Univ, Beijing Key Lab Digital Media, Beijing 100191, Peoples R China
[2] Beihang Univ, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China
[3] Beihang Univ, Sch Astronaut, Dept Guidance Nav & Control, Beijing 100191, Peoples R China
[4] Shanghai Artificial Intelligence Lab, Shanghai 200232, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2023年 / 61卷
基金
中国国家自然科学基金;
关键词
Semantics; Remote sensing; Image synthesis; Generators; Training; Image segmentation; Visualization; Generative adversarial networks; image segmentation; remote sensing images; semantic image synthesis; GAN; AUGMENTATION;
D O I
10.1109/TGRS.2023.3279663
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Generating photo-realistic remote sensing images conditioned on semantic masks has many practical applications like image editing, detecting deep fake geography, and data augmentation. Although previous methods achieved high-quality synthesis results for natural images like faces and everyday objects, they still underperform in remote sensing scenarios in terms of both visual fidelity and diversity. The high data imbalance and high semantic similarity of remote-sensing object categories make the semantic synthesis of remote sensing images more challenging than natural images. To tackle these challenges, we propose a novel method named conducted semantic embedding GAN (CSEBGAN) for semantic-controllable remote sensing image synthesis. The proposed method decouples different semantic classes into independent semantic embeddings, which explores the regularities between classes to improve visual fidelity and naturally supports semantic-level. We further introduce a novel tripartite cooperation adversarial training scheme that involves a conductor network to provide fine-grained semantic feedback for the generator. We also show that the proposed semantic image synthesis method can be utilized as an effective data augmentation approach on improving the performance of the downstream remote sensing image segmentation tasks. Extensive experiments show the superiority of our method compared with the state-of-the-art image synthesis methods.
引用
收藏
页数:11
相关论文
共 46 条
[1]   MFC-GAN: Class-imbalanced dataset classification using Multiple Fake Class Generative Adversarial Network [J].
Ali-Gombe, Adamu ;
Elyan, Eyad .
NEUROCOMPUTING, 2019, 361 :212-221
[2]  
Antoniou A, 2018, Arxiv, DOI arXiv:1711.04340
[3]  
Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
[4]   GAN-based synthetic medical image augmentation for increased CNN performance in liver lesion classification [J].
Frid-Adar, Maayan ;
Diamant, Idit ;
Klang, Eyal ;
Amitai, Michal ;
Goldberger, Jacob ;
Greenspan, Hayit .
NEUROCOMPUTING, 2018, 321 :321-331
[5]   Generative Adversarial Networks [J].
Goodfellow, Ian ;
Pouget-Abadie, Jean ;
Mirza, Mehdi ;
Xu, Bing ;
Warde-Farley, David ;
Ozair, Sherjil ;
Courville, Aaron ;
Bengio, Yoshua .
COMMUNICATIONS OF THE ACM, 2020, 63 (11) :139-144
[6]  
Tanaka FHKD, 2019, Arxiv, DOI arXiv:1904.09135
[7]  
Hensel M, 2017, ADV NEUR IN, V30
[8]  
Ioffe Sergey, 2015, International conference on machine learning, V37, P448, DOI DOI 10.48550/ARXIV.1502.03167
[9]   Image-to-Image Translation with Conditional Adversarial Networks [J].
Isola, Phillip ;
Zhu, Jun-Yan ;
Zhou, Tinghui ;
Efros, Alexei A. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5967-5976
[10]  
Jizhong Zhu, 2017, 2017 IEEE Conference on Energy Internet and Energy System Integration (EI2). Proceedings, DOI 10.1109/EI2.2017.8245683