Consistency-Regularized Region-Growing Network for Semantic Segmentation of Urban Scenes With Point-Level Annotations

被引:33
作者
Xu, Yonghao [1 ]
Ghamisi, Pedram [1 ,2 ]
机构
[1] Inst Adv Res Artificial Intelligence IARAI, A-1030 Vienna, Austria
[2] Helmholtz Inst Freiberg Resource Technol, Machine Learning Grp, Helmholtz Zentrum Dresden Rossendorf, D-09599 Freiberg, Germany
关键词
Annotations; Image segmentation; Semantics; Training; Remote sensing; Knowledge transfer; Predictive models; Semantic segmentation; very high-resolution (VHR) images; weakly supervised learning; sparse annotation; convolutional neural network (CNN); remote sensing; HYPERSPECTRAL IMAGE CLASSIFICATION; FULLY CONVOLUTIONAL NETWORKS; DATA FUSION; FRAMEWORK; SPARSE;
D O I
10.1109/TIP.2022.3189825
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning algorithms have obtained great success in semantic segmentation of very high-resolution (VHR) remote sensing images. Nevertheless, training these models generally requires a large amount of accurate pixel-wise annotations, which is very laborious and time-consuming to collect. To reduce the annotation burden, this paper proposes a consistency-regularized region-growing network (CRGNet) to achieve semantic segmentation of VHR remote sensing images with point-level annotations. The key idea of CRGNet is to iteratively select unlabeled pixels with high confidence to expand the annotated area from the original sparse points. However, since there may exist some errors and noises in the expanded annotations, directly learning from them may mislead the training of the network. To this end, we further propose the consistency regularization strategy, where a base classifier and an expanded classifier are employed. Specifically, the base classifier is supervised by the original sparse annotations, while the expanded classifier aims to learn from the expanded annotations generated by the base classifier with the region-growing mechanism. The consistency regularization is thereby achieved by minimizing the discrepancy between the predictions from both the base and the expanded classifiers. We find such a simple regularization strategy is yet very useful to control the quality of the region-growing mechanism. Extensive experiments on two benchmark datasets demonstrate that the proposed CRGNet significantly outperforms the existing state-of-the-art methods. Codes and pre-trained models are available online (https://github.com/YonghaoXu/CRGNet).
引用
收藏
页码:5038 / 5051
页数:14
相关论文
共 54 条
  • [1] SEEDED REGION GROWING
    ADAMS, R
    BISCHOF, L
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1994, 16 (06) : 641 - 647
  • [2] The Lovasz-Softmax loss: A tractable surrogate for the optimization of the intersection-over-union measure in neural networks
    Berman, Maxim
    Triki, Amal Rannen
    Blaschko, Matthew B.
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4413 - 4421
  • [3] Symmetrical Dense-Shortcut Deep Fully Convolutional Networks for Semantic Segmentation of Very-High-Resolution Remote Sensing Images
    Chen, Guanzhou
    Zhang, Xiaodong
    Wang, Qing
    Dai, Fan
    Gong, Yuanfu
    Zhu, Kun
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2018, 11 (05) : 1633 - 1644
  • [4] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
    Chen, Liang-Chieh
    Papandreou, George
    Kokkinos, Iasonas
    Murphy, Kevin
    Yuille, Alan L.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
  • [5] Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision
    Chen, Xiaokang
    Yuan, Yuhui
    Zeng, Gang
    Wang, Jingdong
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2613 - 2622
  • [6] The DGPF-Test on Digital Airborne Camera Evaluation - Overview and Test Design
    Cramer, Michael
    [J]. PHOTOGRAMMETRIE FERNERKUNDUNG GEOINFORMATION, 2010, (02): : 73 - 82
  • [7] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [8] ResUNet-a: A deep learning framework for semantic segmentation of remotely sensed data
    Diakogiannis, Foivos, I
    Waldner, Francois
    Caccetta, Peter
    Wu, Chen
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 162 (162) : 94 - 114
  • [9] SSF-DAN: Separated Semantic Feature based Domain Adaptation Network for Semantic Segmentation
    Du, Liang
    Tan, Jingang
    Yang, Hongye
    Feng, Jianfeng
    Xue, Xiangyang
    Zheng, Qibao
    Ye, Xiaoqing
    Zhang, Xiaolin
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 982 - 991
  • [10] Multisource and Multitemporal Data Fusion in Remote Sensing A comprehensive review of the state of the art
    Ghamisi, Pedram
    Rasti, Behnood
    Yokoya, Naoto
    Wang, Qunming
    Hoefle, Bernhard
    Bruzzone, Lorenzo
    Bovolo, Francesca
    Chi, Mingmin
    Anders, Katharina
    Gloaguen, Richard
    Atkinson, Peter M.
    Benediktsson, Jon Atli
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING MAGAZINE, 2019, 7 (01) : 6 - 39