Semisupervised Semantic Segmentation of Remote Sensing Images With Consistency Self-Training

被引:33
作者
Li, Jiahao [1 ,2 ]
Sun, Bin [1 ,2 ]
Li, Shutao [1 ,2 ]
Kang, Xudong [1 ,2 ]
机构
[1] Hunan Univ, Key Lab Visual Percept & Artificial Intelligence, Changsha 410082, Peoples R China
[2] Hunan Univ, Coll Elect & Informat Engn, Changsha 410082, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2022年 / 60卷
关键词
Semantics; Predictive models; Image segmentation; Generative adversarial networks; Perturbation methods; Training; Semisupervised learning; Consistency self-training; generative adversarial network (GAN); remote sensing (RS) image; semantic segmentation; semisupervised learning; FEATURE-EXTRACTION; DEEP; NETWORK;
D O I
10.1109/TGRS.2021.3134277
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Semisupervised semantic segmentation is an effective way to reduce the expensive manual annotation cost and take advantage of the unlabeled data for remote sensing (RS) image interpretation. Recent related research has mainly adopted two strategies: self-training and consistency regularization. Self-training tries to acquire accurate pseudo-labels to explicitly expand the train set. However, the existing methods cannot accurately identify false pseudo-labels, suffering from their negative impact on model optimization. The consistency regularization constrains the model by producing consistent predictions robust to the perturbations introduced in the sample or feature domain but requires a sufficient number of training data. Therefore, we propose a strategy for the semisupervised semantic segmentation of the RS images. The proposed model in the generative adversarial network (GAN) framework is optimized by consistency self-training, learning the distributions of both labeled and unlabeled data. The discriminator is optimized by accurate pixel-level training labels instead of the image-level ones, thereby assessing the confidence for the prediction of each pixel, which is then used to reweight the loss of the unlabeled data in self-training. The generator is optimized with the consistency constraint with respect to all random perturbations on the unlabeled data, which increases the sample diversity and prompts the model to learn the underlying distribution of the unlabeled data. Experimental results on the the large-scale and densely annotated Instance Segmentation in Aerial Images Dataset (iSAID) datasets and the International Society for Photogrammetry and Remote Sensing (ISPRS) datasets show that our framework outperforms several state-of-the-art semisupervised semantic segmentation methods.
引用
收藏
页数:11
相关论文
共 53 条
  • [1] [Anonymous], 2018, REALISTIC EVALUATION
  • [2] [Anonymous], 2015, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2015.7298965
  • [3] Chapelle O., 2005, P 10 INT WORKSH ART, P57
  • [4] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
    Chen, Liang-Chieh
    Zhu, Yukun
    Papandreou, George
    Schroff, Florian
    Adam, Hartwig
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
  • [5] Chen LB, 2017, IEEE INT SYMP NANO, P1, DOI 10.1109/NANOARCH.2017.8053709
  • [6] Deep, Big, Simple Neural Nets for Handwritten Digit Recognition
    Ciresan, Dan Claudiu
    Meier, Ueli
    Gambardella, Luca Maria
    Schmidhuber, Juergen
    [J]. NEURAL COMPUTATION, 2010, 22 (12) : 3207 - 3220
  • [7] The Cityscapes Dataset for Semantic Urban Scene Understanding
    Cordts, Marius
    Omran, Mohamed
    Ramos, Sebastian
    Rehfeld, Timo
    Enzweiler, Markus
    Benenson, Rodrigo
    Franke, Uwe
    Roth, Stefan
    Schiele, Bernt
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
  • [8] AutoAugment: Learning Augmentation Strategies from Data
    Cubuk, Ekin D.
    Zoph, Barret
    Mane, Dandelion
    Vasudevan, Vijay
    Le, Quoc V.
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 113 - 123
  • [9] Feng Zhengyang, 2020, ARXIV200408514
  • [10] French G., 2019, Semi-supervised semantic segmentation needs strong, high-dimensional perturbations