Enhancing the utilization of uncertain pixels in semi-supervised semantic segmentation

被引:0
作者
Chang, Xingfang [1 ]
Chen, Changrui [2 ]
Shan, Caifeng [1 ,3 ,4 ]
机构
[1] Shandong Univ Sci & Technol, Coll Elect Engn & Automat, Qingdao 266590, Peoples R China
[2] Univ Warwick, WMG Visualizat, Coventry CV4 7AL, England
[3] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing 210023, Peoples R China
[4] Nanjing Univ, Sch Intelligence Sci & Technol, Nanjing 210023, Peoples R China
关键词
Semantic segmentation; Semi-supervised learning; Uncertain pixels;
D O I
10.1016/j.neucom.2024.128598
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In semi-supervised semantic segmentation, determining the correct label for uncertain pixels is crucial yet challenging. The recently proposed virtual category (VC) learning achieves excellent performance by creating a potential category (PC) set to exploit the valuable part of the uncertain pixels effectively. However, the potential category set in the existing method is not accurate enough at the beginning of iterations, which could lead to less accurate segmentation results. To better utilize uncertain pixels, we propose to improve the strategy of constructing the potential category set. Specifically, we adjust the threshold based on the model's optimization status and split the pseudo-labels into low/high-confidence areas to create a more appropriate PC set. At the same time, due to uncertainty in boundary pixels, it is challenging to achieve precise object segmentation, and there is currently no method available to optimize these areas in semi-supervised segmentation tasks. Therefore, we propose a entropy-based method to identify boundary areas and design a novel boundary refinement network to process labeled and unlabeled data separately to optimize segmentation. The experimental results demonstrate that our proposed method excels in accurately segmenting the boundary areas of the targets. Furthermore, it achieves state-of-the-art semi-supervised segmentation on the Pascal VOC and Cityscapes datasets.
引用
收藏
页数:12
相关论文
共 53 条
[1]   Pseudo-Labeling and Confirmation Bias in Deep Semi-Supervised Learning [J].
Arazo, Eric ;
Ortego, Diego ;
Albert, Paul ;
O'Connor, Noel E. ;
McGuinness, Kevin .
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
[2]  
Berthelot D, 2019, ADV NEUR IN, V32
[3]  
Blum A., 1998, Proceedings of the Eleventh Annual Conference on Computational Learning Theory, P92, DOI 10.1145/279943.279962
[4]  
Chen C., 2022, Proceedings of the European Conference on Computer Vision
[5]   Virtual Category Learning: A Semi-Supervised Learning Method for Dense Prediction With Extremely Limited Labels [J].
Chen, Changrui ;
Han, Jungong ;
Debattista, Kurt .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (08) :5595-5611
[6]  
Chen LC, 2017, Arxiv, DOI [arXiv:1706.05587, 10.48550/arXiv.1706.05587]
[7]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[8]   Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision [J].
Chen, Xiaokang ;
Yuan, Yuhui ;
Zeng, Gang ;
Wang, Jingdong .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :2613-2622
[9]   The Cityscapes Dataset for Semantic Urban Scene Understanding [J].
Cordts, Marius ;
Omran, Mohamed ;
Ramos, Sebastian ;
Rehfeld, Timo ;
Enzweiler, Markus ;
Benenson, Rodrigo ;
Franke, Uwe ;
Roth, Stefan ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223
[10]  
DeVries T, 2017, Arxiv, DOI arXiv:1708.04552