Few-Shot Object Detection with Model Calibration

被引:11
作者
Fan, Qi [1 ]
Tang, Chi-Keung [1 ]
Tai, Yu-Wing [1 ,2 ]
机构
[1] Hong Kong Univ Sci & Technol, Clear Water Bay, Hong Kong, Peoples R China
[2] Kuaishou Technol, Beijing, Peoples R China
来源
COMPUTER VISION, ECCV 2022, PT XIX | 2022年 / 13679卷
关键词
Few-shot object detection; Model bias; Model calibration; Uncertainty-aware RPN; Detector calibration;
D O I
10.1007/978-3-031-19800-7_42
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot object detection (FSOD) targets at transferring knowledge from known to unknown classes to detect objects of novel classes. However, previous works ignore the model bias problem inherent in the transfer learning paradigm. Such model bias causes overfitting toward the training classes and destructs the well-learned transferable knowledge. In this paper, we pinpoint and comprehensively investigate the model bias problem in FSOD models and propose a simple yet effective method to address the model bias problem with the facilitation of model calibrations in three levels: 1) Backbone calibration to preserve the well-learned prior knowledge and relieve the model bias toward base classes, 2) RPN calibration to rescue unlabeled objects of novel classes and, 3) Detector calibration to prevent the model bias toward a few training samples for novel classes. Specifically, we leverage the overlooked classification dataset to facilitate our model calibration procedure, which has only been used for pre-training in other related works. We validate the effectiveness of our model calibration method on the popular Pascal VOC and MS COCO datasets, where our method achieves very promising performance. Codes are released at https://github.com/fanq15/FewX.
引用
收藏
页码:720 / 739
页数:20
相关论文
共 106 条
[21]   Generalized Few-Shot Object Detection without Forgetting [J].
Fan, Zhibo ;
Ma, Yuchen ;
Li, Zeming ;
Sun, Jian .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :4525-4534
[22]  
Finn C, 2017, PR MACH LEARN RES, V70
[23]   Dynamic Few-Shot Visual Learning without Forgetting [J].
Gidaris, Spyros ;
Komodakis, Nikos .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4367-4375
[24]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[25]  
Gordon Jonathan, 2019, ICLR
[26]  
Grant E., 2018, INT C LEARN REPR
[27]   Few-Shot Human Motion Prediction via Meta-learning [J].
Gui, Liang-Yan ;
Wang, Yu-Xiong ;
Ramanan, Deva ;
Moura, Jose M. F. .
COMPUTER VISION - ECCV 2018, PT VIII, 2018, 11212 :441-459
[28]  
Han G., 2021, ICCV
[29]  
He K., 2017, P INT C COMPUTER VIS
[30]   Momentum Contrast for Unsupervised Visual Representation Learning [J].
He, Kaiming ;
Fan, Haoqi ;
Wu, Yuxin ;
Xie, Saining ;
Girshick, Ross .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :9726-9735