Efficient Two-Step Middle-Level Part Feature Extraction for Fine-Grained Visual Categorization

被引:1
作者
Nakayama, Hideki [1 ]
Tsuda, Tomoya [1 ]
机构
[1] Univ Tokyo, Tokyo 1138656, Japan
关键词
image classification; fine-grained categorization; part-based features; dimensionality reduction;
D O I
10.1587/transinf.2015EDP7358
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fine-grained visual categorization ( FGVC) has drawn increasing attention as an emerging research field in recent years. In contrast to generic-domain visual recognition, FGVC is characterized by high intraclass and subtle inter-class variations. To distinguish conceptually and visually similar categories, highly discriminative visual features must be extracted. Moreover, FGVC has highly specialized and task-specific nature. It is not always easy to obtain a sufficiently large-scale training dataset. Therefore, the key to success in practical FGVC systems is to efficiently exploit discriminative features from a limited number of training examples. In this paper, we propose an efficient two-step dimensionality compression method to derive compact middle-level part-based features. To do this, we compare both space-first and feature-first convolution schemes and investigate their effectiveness. Our approach is based on simple linear algebra and analytic solutions, and is highly scalable compared with the current one-vs-one or one-vs-all approach, making it possible to quickly train middlelevel features from a number of pairwise part regions. We experimentally show the effectiveness of our method using the standard Caltech-Birds and Stanford-Cars datasets.
引用
收藏
页码:1626 / 1634
页数:9
相关论文
共 37 条
  • [31] ImageNet Classification with Deep Convolutional Neural Networks
    Krizhevsky, Alex
    Sutskever, Ilya
    Hinton, Geoffrey E.
    [J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90
  • [32] Two-dimensional canonical correlation analysis
    Lee, Sun Ho
    Choi, Seungjin
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2007, 14 (10) : 735 - 738
  • [33] 2D-LDA: A statistical linear discriminant analysis for image matrix
    Li, M
    Yuan, BZ
    [J]. PATTERN RECOGNITION LETTERS, 2005, 26 (05) : 527 - 532
  • [34] Automated flower classification over a large number of classes
    Nilsback, Maria-Elena
    Zisserman, Andrew
    [J]. SIXTH INDIAN CONFERENCE ON COMPUTER VISION, GRAPHICS & IMAGE PROCESSING ICVGIP 2008, 2008, : 722 - 729
  • [35] Wah Catherine, 2011, The caltech-ucsd birds-200-2011 dataset
  • [36] Locality-constrained Linear Coding for Image Classification
    Wang, Jinjun
    Yang, Jianchao
    Yu, Kai
    Lv, Fengjun
    Huang, Thomas
    Gong, Yihong
    [J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 3360 - 3367
  • [37] Hierarchical Part Matching for Fine-Grained Visual Categorization
    Xie, Lingxi
    Tian, Qi
    Hong, Richang
    Yan, Shuicheng
    Zhang, Bo
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 1641 - 1648