Efficient Two-Step Middle-Level Part Feature Extraction for Fine-Grained Visual Categorization

被引:1
作者
Nakayama, Hideki [1 ]
Tsuda, Tomoya [1 ]
机构
[1] Univ Tokyo, Tokyo 1138656, Japan
关键词
image classification; fine-grained categorization; part-based features; dimensionality reduction;
D O I
10.1587/transinf.2015EDP7358
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fine-grained visual categorization ( FGVC) has drawn increasing attention as an emerging research field in recent years. In contrast to generic-domain visual recognition, FGVC is characterized by high intraclass and subtle inter-class variations. To distinguish conceptually and visually similar categories, highly discriminative visual features must be extracted. Moreover, FGVC has highly specialized and task-specific nature. It is not always easy to obtain a sufficiently large-scale training dataset. Therefore, the key to success in practical FGVC systems is to efficiently exploit discriminative features from a limited number of training examples. In this paper, we propose an efficient two-step dimensionality compression method to derive compact middle-level part-based features. To do this, we compare both space-first and feature-first convolution schemes and investigate their effectiveness. Our approach is based on simple linear algebra and analytic solutions, and is highly scalable compared with the current one-vs-one or one-vs-all approach, making it possible to quickly train middlelevel features from a number of pairwise part regions. We experimentally show the effectiveness of our method using the standard Caltech-Birds and Stanford-Cars datasets.
引用
收藏
页码:1626 / 1634
页数:9
相关论文
共 37 条
  • [1] [Anonymous], 2004, ADV NEURAL INF PROCE
  • [2] [Anonymous], 1991, P IEEE CVPR
  • [3] [Anonymous], 2014, P BMVC
  • [4] [Anonymous], 2013, IEEE WORKSH 3D REPR
  • [5] [Anonymous], IEEE FG
  • [6] [Anonymous], 2014, P ICPR
  • [7] [Anonymous], P IEEE ICCV
  • [8] [Anonymous], P ACM MULT
  • [9] [Anonymous], P ECCV
  • [10] [Anonymous], P IEEE CVPR