Semi-supervised Object Detection via VC Learning

被引:2
作者
Chen, Changrui [1 ]
Debattista, Kurt [1 ]
Han, Jungong [1 ,2 ]
机构
[1] Univ Warwick, WMG, Coventry, W Midlands, England
[2] Aberystwyth Univ, Comp Sci, Aberystwyth, Dyfed, Wales
来源
COMPUTER VISION, ECCV 2022, PT XXXI | 2022年 / 13691卷
关键词
Semi-supervised learning; Object detection;
D O I
10.1007/978-3-031-19821-2_10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the costliness of labelled data in real-world applications, semi-supervised object detectors, underpinned by pseudo labelling, are appealing. However, handling confusing samples is nontrivial: discarding valuable confusing samples would compromise the model generalisation while using them for training would exacerbate the confirmation bias issue caused by inevitable mislabelling. To solve this problem, this paper proposes to use confusing samples proactively without label correction. Specifically, a virtual category (VC) is assigned to each confusing sample such that they can safely contribute to the model optimisation even without a concrete label. It is attributed to specifying the embedding distance between the training sample and the virtual category as the lower bound of the inter-class distance. Moreover, we also modify the localisation loss to allow high-quality boundaries for location regression. Extensive experiments demonstrate that the proposed VC learning significantly surpasses the state-of-the-art, especially with small amounts of available labels.
引用
收藏
页码:169 / 185
页数:17
相关论文
共 43 条
[1]  
[Anonymous], 2020, IEEE IJCNN
[2]  
[Anonymous], COCO EVALUATION KIT
[3]  
Bachman Philip, 2014, NEURIPS, V27
[4]  
Berthelot D, 2019, ADV NEUR IN, V32
[5]  
Blum A., 1998, Proceedings of the Eleventh Annual Conference on Computational Learning Theory, P92, DOI 10.1145/279943.279962
[6]  
Bochkovskiy A., 2020, PREPRINT
[7]   Cascade R-CNN: Delving into High Quality Object Detection [J].
Cai, Zhaowei ;
Vasconcelos, Nuno .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6154-6162
[8]   End-to-End Object Detection with Transformers [J].
Carion, Nicolas ;
Massa, Francisco ;
Synnaeve, Gabriel ;
Usunier, Nicolas ;
Kirillov, Alexander ;
Zagoruyko, Sergey .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229
[9]  
Chen J., 2020, INT C MACH LEARN, P1704
[10]  
Chen T, 2020, PR MACH LEARN RES, V119