Semisupervised Semantic Segmentation of Remote Sensing Images With Consistency Self-Training

被引：33

作者：

Li, Jiahao ^{[1
,2
]}

Sun, Bin ^{[1
,2
]}

Li, Shutao ^{[1
,2
]}

Kang, Xudong ^{[1
,2
]}

机构：

[1] Hunan Univ, Key Lab Visual Percept & Artificial Intelligence, Changsha 410082, Peoples R China

[2] Hunan Univ, Coll Elect & Informat Engn, Changsha 410082, Peoples R China

来源：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2022年 / 60卷

关键词：

Semantics; Predictive models; Image segmentation; Generative adversarial networks; Perturbation methods; Training; Semisupervised learning; Consistency self-training; generative adversarial network (GAN); remote sensing (RS) image; semantic segmentation; semisupervised learning; FEATURE-EXTRACTION; DEEP; NETWORK;

D O I：

10.1109/TGRS.2021.3134277

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

Semisupervised semantic segmentation is an effective way to reduce the expensive manual annotation cost and take advantage of the unlabeled data for remote sensing (RS) image interpretation. Recent related research has mainly adopted two strategies: self-training and consistency regularization. Self-training tries to acquire accurate pseudo-labels to explicitly expand the train set. However, the existing methods cannot accurately identify false pseudo-labels, suffering from their negative impact on model optimization. The consistency regularization constrains the model by producing consistent predictions robust to the perturbations introduced in the sample or feature domain but requires a sufficient number of training data. Therefore, we propose a strategy for the semisupervised semantic segmentation of the RS images. The proposed model in the generative adversarial network (GAN) framework is optimized by consistency self-training, learning the distributions of both labeled and unlabeled data. The discriminator is optimized by accurate pixel-level training labels instead of the image-level ones, thereby assessing the confidence for the prediction of each pixel, which is then used to reweight the loss of the unlabeled data in self-training. The generator is optimized with the consistency constraint with respect to all random perturbations on the unlabeled data, which increases the sample diversity and prompts the model to learn the underlying distribution of the unlabeled data. Experimental results on the the large-scale and densely annotated Instance Segmentation in Aerial Images Dataset (iSAID) datasets and the International Society for Photogrammetry and Remote Sensing (ISPRS) datasets show that our framework outperforms several state-of-the-art semisupervised semantic segmentation methods.

引用

页数：11

共 53 条

[1] [Anonymous], 2018, REALISTIC EVALUATION
[2] [Anonymous], 2015, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2015.7298965
[3] Chapelle O., 2005, P 10 INT WORKSH ART, P57
[4] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
Chen, Liang-Chieh
Zhu, Yukun
Papandreou, George
Schroff, Florian
Adam, Hartwig
[J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
[5] Chen LB, 2017, IEEE INT SYMP NANO, P1, DOI 10.1109/NANOARCH.2017.8053709
[6] Deep, Big, Simple Neural Nets for Handwritten Digit Recognition
Ciresan, Dan Claudiu
Meier, Ueli
Gambardella, Luca Maria
Schmidhuber, Juergen
[J]. NEURAL COMPUTATION, 2010, 22 (12) : 3207 - 3220
[7] The Cityscapes Dataset for Semantic Urban Scene Understanding
Cordts, Marius
Omran, Mohamed
Ramos, Sebastian
Rehfeld, Timo
Enzweiler, Markus
Benenson, Rodrigo
Franke, Uwe
Roth, Stefan
Schiele, Bernt
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
[8] AutoAugment: Learning Augmentation Strategies from Data
Cubuk, Ekin D.
Zoph, Barret
Mane, Dandelion
Vasudevan, Vijay
Le, Quoc V.
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 113 - 123
[9] Feng Zhengyang, 2020, ARXIV200408514
[10] French G., 2019, Semi-supervised semantic segmentation needs strong, high-dimensional perturbations

← 1 2 3 4 5 6 →