Discriminative learning and recognition of image set classes using canonical correlations

被引:464
作者
Kim, Tae-Kyun
Kittler, Josef
Cipolla, Roberto
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
[2] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford GU2 7XH, Surrey, England
关键词
object recognition; face recognition; image sets; canonical correlation; principal angles; canonical correlation analysis; linear discriminant analysis; orthogonal subspace method;
D O I
10.1109/TPAMI.2007.1037
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We address the problem of comparing sets of images for object recognition, where the sets may represent variations in an object's appearance due to changing camera pose and lighting conditions. Canonical Correlations (also known as principal or canonical angles), which can be thought of as the angles between two d-dimensional subspaces, have recently attracted attention for image set matching. Canonical correlations offer many benefits in accuracy, efficiency, and robustness compared to the two main classical methods: parametric distribution-based and nonparametric sample-based matching of sets. Here, this is first demonstrated experimentally for reasonably sized data sets using existing methods exploiting canonical correlations. Motivated by their proven effectiveness, a novel discriminative learning method over sets is proposed for set classification. Specifically, inspired by classical Linear Discriminant Analysis (LDA), we develop a linear discriminant function that maximizes the canonical correlations of within-class sets and minimizes the canonical correlations of between-class sets. Image sets transformed by the discriminant function are then compared by the canonical correlations. Classical orthogonal subspace method (OSM) is also investigated for the similar purpose and compared with the proposed method. The proposed method is evaluated on various object recognition problems using face image sets with arbitrary motion captured under different illuminations and image sets of 500 general objects taken at different views. The method is also applied to object category recognition using ETH-80 database. The proposed method is shown to outperform the state-of-the-art methods in terms of accuracy and efficiency.
引用
收藏
页码:1005 / 1018
页数:14
相关论文
共 40 条
[1]  
[Anonymous], J MACHINE LEARNING R
[2]  
[Anonymous], 2005, BRIT MACH VIS C
[3]  
[Anonymous], 2003, P INT S ROB RES
[4]  
Arandjelovic O, 2005, PROC CVPR IEEE, P581
[5]   Eigenfaces vs. Fisherfaces: Recognition using class specific linear projection [J].
Belhumeur, PN ;
Hespanha, JP ;
Kriegman, DJ .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (07) :711-720
[6]   NUMERICAL METHODS FOR COMPUTING ANGLES BETWEEN LINEAR SUBSPACES [J].
BJORCK, A ;
GOLUB, GH .
MATHEMATICS OF COMPUTATION, 1973, 27 (123) :579-594
[7]  
BLACKBURN DM, 2000, FACIAL RECOGNITION V
[8]   Nonparametric discriminant analysis and nearest neighbor classification [J].
Bressan, M ;
Vitrià, J .
PATTERN RECOGNITION LETTERS, 2003, 24 (15) :2743-2749
[9]  
Cover T. M., 2005, ELEM INF THEORY, DOI 10.1002/047174882X
[10]  
Duda R. O., 1973, Pattern Classification