Learning deep semantic segmentation network under multiple weakly-supervised constraints for cross-domain remote sensing image semantic segmentation

被引：183

作者：

Li, Yansheng ^{[1
]}

Shi, Te ^{[1
]}

Zhang, Yongjun ^{[1
]}

Chen, Wei ^{[1
]}

Wang, Zhibin ^{[2
]}

Li, Hao ^{[2
]}

机构：

[1] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan, Hubei, Peoples R China

[2] Alibaba Grp, Hangzhou, Peoples R China

来源：

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING | 2021年 / 175卷

基金：

中国国家自然科学基金; 中国博士后科学基金;

关键词：

Cross-domain remote sensing (RS) image semantic segmentation; Weakly-supervised transfer invariant constraint (WTIC); Weakly-supervised pseudo-label constraint (WPLC); Weakly-supervised rotation consistency constraint (WRCC); DualGAN; Dynamic optimization strategy; LAND-COVER; CLASSIFICATION;

D O I：

10.1016/j.isprsjprs.2021.02.009

中图分类号：

P9 [自然地理学];

学科分类号：

0705 ; 070501 ;

摘要：

Due to its wide applications, remote sensing (RS) image semantic segmentation has attracted increasing research interest in recent years. Benefiting from its hierarchical abstract ability, the deep semantic segmentation network (DSSN) has achieved tremendous success on RS image semantic segmentation and has gradually become the mainstream technology. However, the superior performance of DSSN highly depends on two conditions: (I) massive quantities of labeled training data exist; (II) the testing data seriously resemble the training data. In actual RS applications, it is difficult to fully meet these conditions due to the RS sensor variation and the distinct landscape variation in different geographic locations. To make DSSN fit the actual RS scenario, this paper exploits the cross-domain RS image semantic segmentation task, which means that DSSN is trained on one labeled dataset (i.e., the source domain) but is tested on another varied dataset (i.e., the target domain). In this setting, the performance of DSSN is inevitably very limited due to the data shift between the source and target domains. To reduce the disadvantageous influence of data shift, this paper proposes a novel objective function with multiple weakly-supervised constraints to learn DSSN for cross-domain RS image semantic segmentation. Through carefully examining the characteristics of cross-domain RS image semantic segmentation, multiple weakly-supervised constraints include the weakly-supervised transfer invariant constraint (WTIC), weakly-supervised pseudo-label constraint (WPLC) and weakly-supervised rotation consistency constraint (WRCC). Specifically, DualGAN is recommended to conduct unsupervised style transfer between the source and target domains to carry out WTIC. To make full use of the merits of multiple constraints, this paper presents a dynamic optimization strategy that dynamically adjusts the constraint weights of the objective function during the training process. With full consideration of the characteristics of the cross-domain RS image semantic segmentation task, this paper gives two cross-domain RS image semantic segmentation settings: (I) variation in geographic location and (II) variation in both geographic location and imaging mode. Extensive experiments demonstrate that our proposed method remarkably outperforms the state-of-the-art methods under both of these settings. The collected datasets and evaluation benchmarks have been made publicly available online (htt ps://github.com/te-shi/MUCSS).

引用

页码：20 / 33

页数：14

共 54 条

[1] Unsupervised Domain Adaptation Using Generative Adversarial Networks for Semantic Segmentation of Aerial Images [J].

Benjdira, Bilel ;

Bazi, Yakoub ;

Koubaa, Anis ;

Ouni, Kais .

REMOTE SENSING, 2019, 11 (11)

[2] A multilevel context-based system for classification of very high spatial resolution images [J].

Bruzzone, Lorenzo ;

Carlin, Lorenzo .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2006, 44 (09) :2587-2600

[3] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].

Chen, Liang-Chieh ;

Zhu, Yukun ;

Papandreou, George ;

Schroff, Florian ;

Adam, Hartwig .

COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851

[4] No More Discrimination: Cross City Adaptation of Road Scene Segmenters [J].

Chen, Yi-Hsin ;

Chen, Wei-Yu ;

Chen, Yu-Ting ;

Tsai, Bo-Cheng ;

Wang, Yu-Chiang Frank ;

Sun, Min .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2011-2020

[5] Big Data for Remote Sensing: Challenges and Opportunities [J].

Chi, Mingmin ;

Plaza, Antonio ;

Benediktsson, Jon Atli ;

Sun, Zhongyi ;

Shen, Jinsheng ;

Zhu, Yangyong .

PROCEEDINGS OF THE IEEE, 2016, 104 (11) :2207-2219

[6] The Cityscapes Dataset for Semantic Urban Scene Understanding [J].

Cordts, Marius ;

Omran, Mohamed ;

Ramos, Sebastian ;

Rehfeld, Timo ;

Enzweiler, Markus ;

Benenson, Rodrigo ;

Franke, Uwe ;

Roth, Stefan ;

Schiele, Bernt .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223

[7]

Gerke M., 2014, ResearcheGate, DOI [DOI 10.13140/2.1.5015.9683, 10.13140/2.1.5015.9683]

[8]

Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672

[9] Image analogies [J].

Hertzmann, A ;

Jacobs, CE ;

Oliver, N ;

Curless, B ;

Salesin, DH .

SIGGRAPH 2001 CONFERENCE PROCEEDINGS, 2001, :327-340

[10]

Hoffman J., 2018, P 35 INT C MACH LEAR, P1994, DOI DOI 10.48550/ARXIV.1711.03213

← 1 2 3 4 5 6 →