ResiDualGAN: Resize-Residual DualGAN for Cross-Domain Remote Sensing Images Semantic Segmentation

被引：34

作者：

Zhao, Yang ^{[1
,2
]}

Guo, Peng ^{[1
,2
]}

Sun, Zihao ^{[1
,2
]}

Chen, Xiuwan ^{[1
,2
]}

Gao, Han ^{[1
,2
]}

机构：

[1] Peking Univ, Inst Remote Sensing & Geog Informat Syst, Beijing 100871, Peoples R China

[2] Deyang Inst Smart Agr DISA, TaiShan North Rd 290, Deyang 618099, Peoples R China

来源：

REMOTE SENSING | 2023年 / 15卷 / 05期

关键词：

ResiDualGAN; UDA; remote sensing; semantic segmentation; ADAPTATION; NETWORK;

D O I：

10.3390/rs15051428

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

The performance of a semantic segmentation model for remote sensing (RS) images pre-trained on an annotated dataset greatly decreases when testing on another unannotated dataset because of the domain gap. Adversarial generative methods, e.g., DualGAN, are utilized for unpaired image-to-image translation to minimize the pixel-level domain gap, which is one of the common approaches for unsupervised domain adaptation (UDA). However, the existing image translation methods face two problems when performing RS image translation: (1) ignoring the scale discrepancy between two RS datasets, which greatly affects the accuracy performance of scale-invariant objects; (2) ignoring the characteristic of real-to-real translation of RS images, which brings an unstable factor for the training of the models. In this paper, ResiDualGAN is proposed for RS image translation, where an in-network resizer module is used for addressing the scale discrepancy of RS datasets and a residual connection is used for strengthening the stability of real-to-real images translation and improving the performance in cross-domain semantic segmentation tasks. Combined with an output space adaptation method, the proposed method greatly improves the accuracy performance on common benchmarks, which demonstrates the superiority and reliability of ResiDualGAN. At the end of the paper, a thorough discussion is conducted to provide a reasonable explanation for the improvement of ResiDualGAN. Our source code is also available.

引用

页数：20

共 53 条

[1]

[Anonymous], 2013, PROC 30 INT C MACH L

[2]

Arjovsky M, 2017, PR MACH LEARN RES, V70

[3] Domain Adaptation for Remote Sensing Image Semantic Segmentation: An Integrated Approach of Contrastive Learning and Adversarial Learning [J].

Bai, Lubin ;

Du, Shihong ;

Zhang, Xiuyuan ;

Wang, Haoyu ;

Liu, Bo ;

Ouyang, Song .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60

[4] Unsupervised Domain Adaptation Using Generative Adversarial Networks for Semantic Segmentation of Aerial Images [J].

Benjdira, Bilel ;

Bazi, Yakoub ;

Koubaa, Anis ;

Ouni, Kais .

REMOTE SENSING, 2019, 11 (11)

[5] Unsupervised Pixel-Level Domain Adaptation with Generative Adversarial Networks [J].

Bousmalis, Konstantinos ;

Silberman, Nathan ;

Dohan, David ;

Erhan, Dumitru ;

Krishnan, Dilip .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :95-104

[6]

Chaurasia A, 2017, 2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP)

[7] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].

Chen, Liang-Chieh ;

Zhu, Yukun ;

Papandreou, George ;

Schroff, Florian ;

Adam, Hartwig .

COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851

[8]

Chen LB, 2017, IEEE INT SYMP NANO, P1, DOI 10.1109/NANOARCH.2017.8053709

[9] The Cityscapes Dataset for Semantic Urban Scene Understanding [J].

Cordts, Marius ;

Omran, Mohamed ;

Ramos, Sebastian ;

Rehfeld, Timo ;

Enzweiler, Markus ;

Benenson, Rodrigo ;

Franke, Uwe ;

Roth, Stefan ;

Schiele, Bernt .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223

[10]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

← 1 2 3 4 5 6 →