Label, Verify, Correct: A Simple Few Shot Object Detection Method

被引:80
作者
Kaul, Prannay [1 ]
Xie, Weidi [1 ,2 ]
Zisserman, Andrew [1 ]
机构
[1] Univ Oxford, Visual Geometry Grp, Oxford, England
[2] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
来源
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2022年
基金
英国工程与自然科学研究理事会;
关键词
D O I
10.1109/CVPR52688.2022.01384
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The objective of this paper is few-shot object detection (FSOD) - the task of expanding an object detector for a new category given only a few instances for training. We introduce a simple pseudo-labelling method to source high-quality pseudo-annotations from the training set, for each new category, vastly increasing the number of training instances and reducing class imbalance; our method finds previously unlabelled instances. Naively training with model predictions yields suboptimal performance; we present two novel methods to improve the precision of the pseudo-labelling process: first, we introduce a verification technique to remove candidate detections with incorrect class labels; second, we train a specialised model to correct poor quality bounding boxes. After these two novel steps, we obtain a large set of high-quality pseudo-annotations that allow our final detector to be trained end-to-end. Additionally, we demonstrate our method maintains base class performance, and the utility of simple augmentations in FSOD. While benchmarking on PASCAL VOC and MS-COCO, our method achieves state-of-the-art or second-best performance compared to existing approaches across all number of shots.
引用
收藏
页码:14217 / 14227
页数:11
相关论文
共 59 条
[1]  
AmingWu Yahong Han, 2021, ICCV, P9567
[2]  
[Anonymous], 2019, ARXIV190407850, DOI DOI 10.1080/08870446.2019.1574348
[3]  
Bochkovskiy A, 2020, ARXIV, DOI 10.48550/ARXIV.2004.10934
[4]  
Caron Mathilde, 2020, ARXIV200609882
[5]  
Caron Mathilde, 2021, ICCV
[6]  
Chen H., 2018, Proceedings of the AAAI Conference on Artificial Intelligence
[7]   Xception: Deep Learning with Depthwise Separable Convolutions [J].
Chollet, Francois .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1800-1807
[8]  
Chum Ondrej, 2007, P 11 INT C COMP VIS
[9]  
Doersch C., 2020, P ADV NEUR INF PROC, P21981
[10]  
Dosovitskiy A, 2020, ARXIV