Hierarchical discriminant analysis for image retrieval

被引:57
作者
Swets, DL
Weng, JY
机构
[1] Augustana Coll, Dept Comp Sci, Sioux Falls, SD 57197 USA
[2] Michigan State Univ, Dept Comp Sci, E Lansing, MI 48824 USA
基金
美国国家航空航天局; 美国国家科学基金会;
关键词
principal component analysis; discriminant analysis; hierarchical image database; image retrieval; tessellation; partitioning; object recognition; face recognition; complexity with large image databases;
D O I
10.1109/34.765652
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A self-organizing framework for object recognition is described. We describe a hierarchical database structure for image retrieval. The Self-Organizing Hierarchical Optimal Subspace Learning and Inference Framework (SHOSLIF) system uses the theories of optimal linear projection for automatic optimal feature derivation and a hierarchical structure to achieve a logarithmic retrieval complexity. A Space-Tessellation Tree is automatically generated using the Most Expressive Features (MEFs) and the Most Discriminating Features (MDFs) at each level of the tree. The major characteristics of the proposed hierarchical discriminant analysis include: 1) avoiding the limitation of global linear features (hyperplanes as separators) by deriving a recursively better-fitted set of features for each of the recursively subdivided sets of training samples; 2) generating a smaller tree whose cell boundaries separate the samples along the class boundaries better than the principal component analysis, thereby giving a better generalization capability (i.e., better recognition rate in a disjoint test); 3) accelerating the retrieval using a tree structure for data pruning, utilizing a different set of discriminant features at each level of the tree. We allow for perturbations in the size and position of objects in the images through learning. We demonstrate the technique on a large image database of widely varying real-world objects taken in natural settings, and show the applicability of the approach for variability in position, size, and 3D orientation. This paper concentrates on the hierarchical partitioning of the feature spaces.
引用
收藏
页码:386 / 401
页数:16
相关论文
共 43 条
  • [1] [Anonymous], 1990, THINKING INVITATION
  • [2] BAUM EB, 1990, P EURASIP WORKSHOP N, P2
  • [3] BREGLER C, 1995, FIFTH INTERNATIONAL CONFERENCE ON COMPUTER VISION, PROCEEDINGS, P494, DOI 10.1109/ICCV.1995.466899
  • [4] Breiman L., 1993, CLASSIFICATION REGRE
  • [5] DATTATREYA GR, 1985, PROGR PATTERN RECOGN, P189
  • [6] Duda R.O., 1973, Pattern classification
  • [7] The statistical utilization of multiple measurements
    Fisher, RA
    [J]. ANNALS OF EUGENICS, 1938, 8 : 376 - 386
  • [8] FRIEDMAN JH, 1977, IEEE T COMPUT, V26, P404, DOI 10.1109/TC.1977.1674849
  • [9] Fukunaga K., 1990, INTRO STAT PATTERN R
  • [10] Grimson W.E.L., 1990, OBJECT RECOGNITION C