PBC: Polygon-Based Classifier for Fine-Grained Categorization

被引：36

作者：

Huang, Chao ^{[1
]}

Li, Hongliang ^{[1
]}

Xie, Yurui ^{[1
]}

Wu, Qingbo ^{[1
]}

Luo, Bing ^{[1
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Elect Engn, Chengdu 611731, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2017年 / 19卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Polygon-based classifier; fine-grained categorization; greedy algorithm; coarse-to-fine method; SEGMENTATION;

D O I：

10.1109/TMM.2016.2631122

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Fine-grained categorization is a challenging task mainly due to two factors: first, objects share similar appearances between different categories; second, objects present significant pose variation within the same category. To address these challenges, we propose a method to automatically detect discriminative and pose-invariant regions, which is referred to as a polygon-based classifier (PBC). In the first stage, we generate a set of polygons that are composed of multiple parts. For each polygon, a classifier is trained based on deep features of a convolutional network. Then, a greedy algorithm is employed to select the discriminative and complementary polygon-based classifiers that deliver highest classification accuracy for fine-grained object categories. In the second stage, the confusing classes of the first stage are selected and employed to train the polygon-based classifiers. Then, a greedy algorithm is employed to select discriminative classifiers. For the test images, we use the classifiers trained in the first stage to obtain a coarse result. Then, the classifiers of the second stage are adopted to distinguish the confusing classes of the coarse result. In our experiments, the proposed approach is evaluated on three well-known fine-grained datasets. The experiments show that our approach outperforms the state-of-the-art methods.

引用

页码：673 / 684

页数：12

共 62 条

[1] Efficient object detection and segmentation for fine-grained recognition [J].

Angelova, Anelia ;

Zhu, Shenghuo .

2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :811-818

[2]

[Anonymous], 2014, P BRIT MACH VIS C, DOI 10.5244/C.28.87

[3]

[Anonymous], 2006, PATTERN RECOGN, DOI DOI 10.1117/1.2819119

[4]

[Anonymous], CORR

[5]

[Anonymous], 2007, P IEEE CVPR

[6]

[Anonymous], 2011, CALTECH UCSD BIRDS 2

[7]

[Anonymous], 2014, P 31 INT C INT C MAC

[8]

[Anonymous], 2013, Caffe: An Open Source Convolutional Architecture for Fast Feature Embedding

[9]

[Anonymous], 2015, PROC 28 INT C NEURAL

[10]

Azizpour Hossein, 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), P36, DOI 10.1109/CVPRW.2015.7301270

← 1 2 3 4 5 6 7 →