Few-shot logo detection

被引:2
作者
Hou, Sujuan [1 ]
Liu, Wenjie [1 ]
Karim, Awudu [2 ]
Jia, Zhixiang [1 ]
Jia, Weikuan [1 ]
Zheng, Yuanjie [1 ]
机构
[1] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan, Peoples R China
[2] Beijing Univ Technol, Sch Engn, Beijing, Peoples R China
关键词
computer vision; object detection; RECOGNITION; NETWORK;
D O I
10.1049/cvi2.12205
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The proliferation of deep learning has driven research into deep learning-based logo detection, which usually needs a large number of annotated data to train the model. However, due to the occasional appearance of new brands or the high cost of annotation, the number of training data is limited. Against this backdrop, the authors adapt the few-shot object detection into logo detection, and thus present a cutting-edge method called Double Classification Head (DCH) for Few-Shot Logo Detection (DCH-FSLogo), which aims at detecting the unseen logo classes using few annotated data. Unlike the traditional few-shot detection, some logo objects are similar to their backgrounds and have diverse shapes as well. For this reason, the authors adopt balanced feature pyramid and deformable Region of Interest pooling in DCH-FSLogo, this enhances the feature extraction capability and adapts to the different logo shapes. In addition, we introduce the DCH for few-shot logo detection to detect logo objects using few annotated data. Specifically, we use an extra classification head for the base classes to ease the influence from the novel classes. The experimental results on four datasets, namely: FlickrLogos-32, FoodLogoDet-1500-100, LogoDet-3K-100 and QMUL-OpenLogo-100, demonstrate that our method achieves better performance.
引用
收藏
页码:586 / 598
页数:13
相关论文
共 60 条
[1]   Meta-Learning with Task-Adaptive Loss Function for Few-Shot Learning [J].
Baik, Sungyong ;
Choi, Janghoon ;
Kim, Heewon ;
Cho, Dohee ;
Min, Jaesik ;
Lee, Kyoung Mu .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9445-9454
[2]   End-to-End Object Detection with Transformers [J].
Carion, Nicolas ;
Massa, Francisco ;
Synnaeve, Gabriel ;
Usunier, Nicolas ;
Kirillov, Alexander ;
Zagoruyko, Sergey .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229
[3]  
Chen W-Y, 2019, ICLR
[4]   SUPPORT-VECTOR NETWORKS [J].
CORTES, C ;
VAPNIK, V .
MACHINE LEARNING, 1995, 20 (03) :273-297
[5]   Deformable Convolutional Networks [J].
Dai, Jifeng ;
Qi, Haozhi ;
Xiong, Yuwen ;
Li, Yi ;
Zhang, Guodong ;
Hu, Han ;
Wei, Yichen .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773
[6]   Few-Example Object Detection with Model Communication [J].
Dong, Xuanyi ;
Zheng, Liang ;
Ma, Fan ;
Yang, Yi ;
Meng, Deyu .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (07) :1641-1654
[7]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[8]   Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector [J].
Fan, Qi ;
Zhuo, Wei ;
Tang, Chi-Keung ;
Tai, Yu-Wing .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :4012-4021
[9]   Generalized Few-Shot Object Detection without Forgetting [J].
Fan, Zhibo ;
Ma, Yuchen ;
Li, Zeming ;
Sun, Jian .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :4525-4534
[10]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587