Learning Category-Specific Dictionary and Shared Dictionary for Fine-Grained Image Categorization

被引:138
作者
Gao, Shenghua [1 ]
Tsang, Ivor Wai-Hung [2 ]
Ma, Yi [3 ]
机构
[1] Adv Digital Sci Ctr, Singapore 138632, Singapore
[2] Nanyang Technol Univ, Singapore 639798, Singapore
[3] Microsoft Res, Beijing 100080, Peoples R China
关键词
Class-specific dictionary; shared dictionary; fine-grained classification; SPARSE REPRESENTATION; ALGORITHM;
D O I
10.1109/TIP.2013.2290593
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper targets fine-grained image categorization by learning a category-specific dictionary for each category and a shared dictionary for all the categories. Such category-specific dictionaries encode subtle visual differences among different categories, while the shared dictionary encodes common visual patterns among all the categories. To this end, we impose incoherence constraints among the different dictionaries in the objective of feature coding. In addition, to make the learnt dictionary stable, we also impose the constraint that each dictionary should be self-incoherent. Our proposed dictionary learning formulation not only applies to fine-grained classification, but also improves conventional basic-level object categorization and other tasks such as event recognition. Experimental results on five data sets show that our method can outperform the state-of-the-art fine-grained image categorization frameworks as well as sparse coding based dictionary learning frameworks. All these results demonstrate the effectiveness of our method.
引用
收藏
页码:623 / 634
页数:12
相关论文
共 50 条
[1]   K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation [J].
Aharon, Michal ;
Elad, Michael ;
Bruckstein, Alfred .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (11) :4311-4322
[2]  
[Anonymous], 2005, PROC CVPR IEEE
[3]  
[Anonymous], 2010, PASCAL VISUAL OBJECT
[4]   A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems [J].
Beck, Amir ;
Teboulle, Marc .
SIAM JOURNAL ON IMAGING SCIENCES, 2009, 2 (01) :183-202
[5]  
Boureau Y, 2010, P 27 INT C MACH LEAR, P111, DOI DOI 10.5555/3104322.3104338
[6]  
Boureau YL, 2011, IEEE I CONF COMP VIS, P2651, DOI 10.1109/ICCV.2011.6126555
[7]   Learning Discriminative Sparse Representations for Modeling, Source Separation, and Mapping of Hyperspectral Imagery [J].
Castrodad, Alexey ;
Xing, Zhengming ;
Greer, John B. ;
Bosch, Edward ;
Carin, Lawrence ;
Sapiro, Guillermo .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2011, 49 (11) :4263-4281
[8]   Exploiting Hierarchical Context on a Large Database of Object Categories [J].
Choi, Myung Jin ;
Lim, Joseph J. ;
Torralba, Antonio ;
Willsky, Alan S. .
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :129-136
[9]   SUPPORT-VECTOR NETWORKS [J].
CORTES, C ;
VAPNIK, V .
MACHINE LEARNING, 1995, 20 (03) :273-297
[10]   Image denoising via sparse and redundant representations over learned dictionaries [J].
Elad, Michael ;
Aharon, Michal .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (12) :3736-3745