Multi-scale Positive Sample Refinement for Few-Shot Object Detection

被引:266
作者
Wu, Jiaxi [1 ,2 ,3 ]
Liu, Songtao [1 ,2 ,3 ]
Huang, Di [1 ,2 ,3 ]
Wang, Yunhong [1 ,3 ]
机构
[1] Beihang Univ, BAIC BDBC, Beijing 100191, Peoples R China
[2] Beihang Univ, SKLSDE, Beijing 100191, Peoples R China
[3] Beihang Univ, SCSE, Beijing 100191, Peoples R China
来源
COMPUTER VISION - ECCV 2020, PT XVI | 2020年 / 12361卷
关键词
Few-shot object detection; Multi-scale refinement;
D O I
10.1007/978-3-030-58517-4_27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot object detection (FSOD) helps detectors adapt to unseen classes with few training instances, and is useful when manual annotation is time-consuming or data acquisition is limited. Unlike previous attempts that exploit few-shot classification techniques to facilitate FSOD, this work highlights the necessity of handling the problem of scale variations, which is challenging due to the unique sample distribution. To this end, we propose a Multi-scale Positive Sample Refinement (MPSR) approach to enrich object scales in FSOD. It generates multi-scale positive samples as object pyramids and refines the prediction at various scales. We demonstrate its advantage by integrating it as an auxiliary branch to the popular architecture of Faster R-CNN with FPN, delivering a strong FSOD solution. Several experiments are conducted on PASCAL VOC and MS COCO, and the proposed approach achieves state of the art results and significantly outperforms other counterparts, which shows its effectiveness. Code is available at https://github.com/jiaxi-wu/MPSR.
引用
收藏
页码:456 / 472
页数:17
相关论文
共 41 条
[1]  
Bertinetto L, 2016, ADV NEUR IN, V29
[2]   Weakly Supervised Deep Detection Networks [J].
Bilen, Hakan ;
Vedaldi, Andrea .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2846-2854
[3]   Cascade R-CNN: Delving into High Quality Object Detection [J].
Cai, Zhaowei ;
Vasconcelos, Nuno .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6154-6162
[4]  
Chen H, 2018, AAAI CONF ARTIF INTE, P2836
[5]   Few-Example Object Detection with Model Communication [J].
Dong, Xuanyi ;
Zheng, Liang ;
Ma, Fan ;
Yang, Yi ;
Meng, Deyu .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (07) :1641-1654
[6]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[7]   The PASCAL Visual Object Classes Challenge: A Retrospective [J].
Everingham, Mark ;
Eslami, S. M. Ali ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) :98-136
[8]   Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector [J].
Fan, Qi ;
Zhuo, Wei ;
Tang, Chi-Keung ;
Tai, Yu-Wing .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :4012-4021
[9]  
Finn C, 2017, PR MACH LEARN RES, V70
[10]   NOTE-RCNN: NOise Tolerant Ensemble RCNN for Semi-Supervised Object Detection [J].
Gao, Jiyang ;
Wang, Jiang ;
Dai, Shengyang ;
Li, Li-Jia ;
Nevatia, Ram .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9507-9516