Graphic Logo Detection with Deep Region-based Convolutional Networks

被引：0

作者：

Li, Yuanyuan ^{[1
,2
]}

Shi, Qiuyue ^{[1
,2
]}

Deng, Jiangfan ^{[1
,2
]}

Su, Fei ^{[1
,2
]}

机构：

[1] Beijing Univ Posts & Telecommun, Sch Commun & Informat Engn, Beijing, Peoples R China

[2] Beijing Univ Posts & Telecommun, Beijing Key Lab Network Syst & Network Culture, Beijing, Peoples R China

来源：

2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP) | 2017年

关键词：

Logo detection; Faster R-CNN; Data augmentation; Network modification;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Logo detection is a challenging task with many practical applications in our daily life and intellectual property protection. The two main obstacles here are lack of public logo datasets and effective design of logo detection structure. In this paper, we first manually collected and annotated 6,400 images and mix them with FlickrLogo-32 dataset, forming a larger dataset. Secondly, we constructed Faster R-CNN frameworks with several widely used classification models for logo detection. Furthermore, the transfer learning method was introduced in the training process. Finally, clustering was used to guarantee suitable hyper-parameters and more precise anchors of RPN. Experimental results show that the proposed framework outperforms the state of-the-art methods with a noticeable margin.

引用

页数：4

共 17 条

[1] [Anonymous], 2015, ARXIV151002131
[2] Region-based CNN for Logo Detection
Bao, Yu
Li, Haojie
Fan, Xin
Liu, Risheng
Jia, Qi
[J]. 8TH INTERNATIONAL CONFERENCE ON INTERNET MULTIMEDIA COMPUTING AND SERVICE (ICIMCS2016), 2016, : 319 - 322
[3] Francesconi E., 1997, Graphics Recognition-Algorithms and Systems, V1389, P104
[4] Girshick R., 2014, IEEE C COMP VIS PATT, DOI [DOI 10.1109/CVPR.2014.81, 10.1109/CVPR.2014.81]
[5] Fast R-CNN
Girshick, Ross
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448
[6] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
[7] Hoi S.C.H., 2015, IEEE T PATT AN MACH, V46, P2403
[8] Caffe: Convolutional Architecture for Fast Feature Embedding
Jia, Yangqing
Shelhamer, Evan
Donahue, Jeff
Karayev, Sergey
Long, Jonathan
Girshick, Ross
Guadarrama, Sergio
Darrell, Trevor
[J]. PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 675 - 678
[9] Distinctive image features from scale-invariant keypoints
Lowe, DG
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (02) : 91 - 110
[10] Oliveira G, 2016, IEEE IJCNN, P985, DOI 10.1109/IJCNN.2016.7727305

← 1 2 →