Can computer vision problems benefit from structured hierarchical classification?

被引:7
作者
Hoyoux, Thomas [1 ]
Rodriguez-Sanchez, Antonio J. [2 ]
Piater, Justus H. [2 ]
机构
[1] Univ Liege, Inst Montefiore, Signal & Image Exploitat INTELSIG, Liege, Belgium
[2] Univ Innsbruck, Inst Comp Sci, Intelligent & Interact Syst, Innsbruck, Austria
关键词
Hierarchical classification; Flat classification; Structured K-nearest neighbors; Structured support vector machines; Maximum margin regression; 3D shape classification; Expression recognition; Simulation framework; Feature representations; OBJECT RECOGNITION; SHAPE;
D O I
10.1007/s00138-016-0763-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Research in the field of supervised classification has mostly focused on the standard, so-called "flat" classification approach, where the problem classes live in a trivial, one-level semantic space. There is however an increasing interest in the hierarchical classification approach, where a performance gain is expected by incorporating prior taxonomic knowledge about the classes into the learning process. Intuitively, the hierarchical approach should be beneficial in general for the classification of visual content, as suggested by the fact that humans seem to organize objects into hierarchies based on visually perceived similarities. In this paper, we provide an analysis that aims to determine the conditions under which the hierarchical approach can consistently give better performances than the flat approach for the classification of visual content. In particular, we (1) show how hierarchical methods can fail to outperform flat methods when applied to real vision-based classification problems, and (2) investigate the underlying reasons for the lack of improvement, by applying the same methods to synthetic datasets in a simulation. Our conclusion is that the use of high-level hierarchical feature representations is crucial for obtaining a performance gain with the hierarchical approach, and that poorly chosen prior taxonomies hinder this gain even though proper high-level features are used.
引用
收藏
页码:1299 / 1312
页数:14
相关论文
共 35 条
  • [1] [Anonymous], P 2008 IEEE C COMP V
  • [2] [Anonymous], 2004, ISMIR
  • [3] [Anonymous], P BIOLINK SIG LLIKB
  • [4] [Anonymous], P IROS 15
  • [5] [Anonymous], 2010, PROCDPVT
  • [6] Astikainen Katja, 2008, BMC Proc, V2 Suppl 4, pS2
  • [7] Shape Similarity, Better than Semantic Membership, Accounts for the Structure of Visual Object Representations in a Population of Monkey Inferotemporal Neurons
    Baldassi, Carlo
    Alemi-Neissi, Alireza
    Pagan, Marino
    DiCarlo, James J.
    Zecchina, Riccardo
    Zoccolan, Davide
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2013, 9 (08)
  • [8] Barutcuoglu Z, 2006, IEEE INTERNATIONAL CONFERENCE ON SHAPE MODELING AND APPLICATIONS 2006, PROCEEDINGS, P289
  • [9] Shape matching and object recognition using shape contexts
    Belongie, S
    Malik, J
    Puzicha, J
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (04) : 509 - 522
  • [10] On the algorithmic implementation of multiclass kernel-based vector machines
    Crammer, K
    Singer, Y
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (02) : 265 - 292