Consistency-Regularized Region-Growing Network for Semantic Segmentation of Urban Scenes With Point-Level Annotations

被引：33

作者：

Xu, Yonghao ^{[1
]}

Ghamisi, Pedram ^{[1
,2
]}

机构：

[1] Inst Adv Res Artificial Intelligence IARAI, A-1030 Vienna, Austria

[2] Helmholtz Inst Freiberg Resource Technol, Machine Learning Grp, Helmholtz Zentrum Dresden Rossendorf, D-09599 Freiberg, Germany

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2022年 / 31卷

关键词：

Annotations; Image segmentation; Semantics; Training; Remote sensing; Knowledge transfer; Predictive models; Semantic segmentation; very high-resolution (VHR) images; weakly supervised learning; sparse annotation; convolutional neural network (CNN); remote sensing; HYPERSPECTRAL IMAGE CLASSIFICATION; FULLY CONVOLUTIONAL NETWORKS; DATA FUSION; FRAMEWORK; SPARSE;

D O I：

10.1109/TIP.2022.3189825

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep learning algorithms have obtained great success in semantic segmentation of very high-resolution (VHR) remote sensing images. Nevertheless, training these models generally requires a large amount of accurate pixel-wise annotations, which is very laborious and time-consuming to collect. To reduce the annotation burden, this paper proposes a consistency-regularized region-growing network (CRGNet) to achieve semantic segmentation of VHR remote sensing images with point-level annotations. The key idea of CRGNet is to iteratively select unlabeled pixels with high confidence to expand the annotated area from the original sparse points. However, since there may exist some errors and noises in the expanded annotations, directly learning from them may mislead the training of the network. To this end, we further propose the consistency regularization strategy, where a base classifier and an expanded classifier are employed. Specifically, the base classifier is supervised by the original sparse annotations, while the expanded classifier aims to learn from the expanded annotations generated by the base classifier with the region-growing mechanism. The consistency regularization is thereby achieved by minimizing the discrepancy between the predictions from both the base and the expanded classifiers. We find such a simple regularization strategy is yet very useful to control the quality of the region-growing mechanism. Extensive experiments on two benchmark datasets demonstrate that the proposed CRGNet significantly outperforms the existing state-of-the-art methods. Codes and pre-trained models are available online (https://github.com/YonghaoXu/CRGNet).

引用

页码：5038 / 5051

页数：14

共 54 条

[1] SEEDED REGION GROWING
ADAMS, R
BISCHOF, L
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1994, 16 (06) : 641 - 647
[2] The Lovasz-Softmax loss: A tractable surrogate for the optimization of the intersection-over-union measure in neural networks
Berman, Maxim
Triki, Amal Rannen
Blaschko, Matthew B.
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4413 - 4421
[3] Symmetrical Dense-Shortcut Deep Fully Convolutional Networks for Semantic Segmentation of Very-High-Resolution Remote Sensing Images
Chen, Guanzhou
Zhang, Xiaodong
Wang, Qing
Dai, Fan
Gong, Yuanfu
Zhu, Kun
[J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2018, 11 (05) : 1633 - 1644
[4] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
Chen, Liang-Chieh
Papandreou, George
Kokkinos, Iasonas
Murphy, Kevin
Yuille, Alan L.
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
[5] Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision
Chen, Xiaokang
Yuan, Yuhui
Zeng, Gang
Wang, Jingdong
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2613 - 2622
[6] The DGPF-Test on Digital Airborne Camera Evaluation - Overview and Test Design
Cramer, Michael
[J]. PHOTOGRAMMETRIE FERNERKUNDUNG GEOINFORMATION, 2010, (02): : 73 - 82
[7] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[8] ResUNet-a: A deep learning framework for semantic segmentation of remotely sensed data
Diakogiannis, Foivos, I
Waldner, Francois
Caccetta, Peter
Wu, Chen
[J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 162 (162) : 94 - 114
[9] SSF-DAN: Separated Semantic Feature based Domain Adaptation Network for Semantic Segmentation
Du, Liang
Tan, Jingang
Yang, Hongye
Feng, Jianfeng
Xue, Xiangyang
Zheng, Qibao
Ye, Xiaoqing
Zhang, Xiaolin
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 982 - 991
[10] Multisource and Multitemporal Data Fusion in Remote Sensing A comprehensive review of the state of the art
Ghamisi, Pedram
Rasti, Behnood
Yokoya, Naoto
Wang, Qunming
Hoefle, Bernhard
Bruzzone, Lorenzo
Bovolo, Francesca
Chi, Mingmin
Anders, Katharina
Gloaguen, Richard
Atkinson, Peter M.
Benediktsson, Jon Atli
[J]. IEEE GEOSCIENCE AND REMOTE SENSING MAGAZINE, 2019, 7 (01) : 6 - 39

← 1 2 3 4 5 6 →