Few-Shot Object Detection using Global Attention and Support Attention

被引:1
作者
Yang, Chongzhi [1 ]
Yu, Linfang [1 ]
Xiao, Peng [1 ]
Wang, Bintao [1 ]
机构
[1] UESTC, Sch Informat & Software Engn, Chengdu, Peoples R China
来源
2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020) | 2020年
基金
中国国家自然科学基金;
关键词
few shot; object detection; global attention; support attention;
D O I
10.1109/ICMCCE51767.2020.00317
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
General object detection methods in computer vision task typically require a lot of bounding box labeled data. In addition, after training on the training data of the specified category, only the existing categories in the training can be detected. In order to detect the new category, sufficient training data of the new category must be collected and the new category can only be detected after the network is trained again. In this paper, we design a few-shot object detection network, aiming to achieve object detection against unseen categories by using a small number of annotated sample data as support sets. Our network is based on Faster R-CNN, and using a Fixed-size Global Attention module on the features generated by the backbone so that the whole network has different attention weights for different positions of the image. A Support-Set Attention module is proposed which enables the network to focus the detection target on the categories provided by the support set. Finally, the relation detection module uses the similarity between the few-shot support set and the query set to give the matching score and the location regression. After training on the base class, our network can be directly used on N-way K-shot detection without fine-tuning. In our experiments, our network achieved equal or better results than others.
引用
收藏
页码:1446 / 1450
页数:5
相关论文
共 34 条
[1]   Brain tumor segmentation with deep convolutional symmetric neural network [J].
Chen, Hao ;
Qin, Zhiguang ;
Ding, Yi ;
Tian, Lan ;
Qin, Zhen .
NEUROCOMPUTING, 2020, 392 :305-313
[2]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[3]   Few-Example Object Detection with Model Communication [J].
Dong, Xuanyi ;
Zheng, Liang ;
Ma, Fan ;
Yang, Yi ;
Meng, Deyu .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (07) :1641-1654
[4]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[5]   Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector [J].
Fan, Qi ;
Zhuo, Wei ;
Tang, Chi-Keung ;
Tai, Yu-Wing .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :4012-4021
[6]   An End-to-End Neural Network for Road Extraction From Remote Sensing Imagery by Multiple Feature Pyramid Network [J].
Gao, Xun ;
Sun, Xian ;
Zhang, Yi ;
Yan, Menglong ;
Xu, Guangluan ;
Sun, Hao ;
Jiao, Jiao ;
Fu, Kun .
IEEE ACCESS, 2018, 6 :39401-39414
[7]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[8]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[9]  
Gregory K., 2015, ICML WORKSH
[10]  
He KM, 2020, IEEE T PATTERN ANAL, V42, P386, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]