Discriminative learning and recognition of image set classes using canonical correlations

被引:464
作者
Kim, Tae-Kyun
Kittler, Josef
Cipolla, Roberto
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
[2] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford GU2 7XH, Surrey, England
关键词
object recognition; face recognition; image sets; canonical correlation; principal angles; canonical correlation analysis; linear discriminant analysis; orthogonal subspace method;
D O I
10.1109/TPAMI.2007.1037
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We address the problem of comparing sets of images for object recognition, where the sets may represent variations in an object's appearance due to changing camera pose and lighting conditions. Canonical Correlations (also known as principal or canonical angles), which can be thought of as the angles between two d-dimensional subspaces, have recently attracted attention for image set matching. Canonical correlations offer many benefits in accuracy, efficiency, and robustness compared to the two main classical methods: parametric distribution-based and nonparametric sample-based matching of sets. Here, this is first demonstrated experimentally for reasonably sized data sets using existing methods exploiting canonical correlations. Motivated by their proven effectiveness, a novel discriminative learning method over sets is proposed for set classification. Specifically, inspired by classical Linear Discriminant Analysis (LDA), we develop a linear discriminant function that maximizes the canonical correlations of within-class sets and minimizes the canonical correlations of between-class sets. Image sets transformed by the discriminant function are then compared by the canonical correlations. Classical orthogonal subspace method (OSM) is also investigated for the similar purpose and compared with the proposed method. The proposed method is evaluated on various object recognition problems using face image sets with arbitrary motion captured under different illuminations and image sets of 500 general objects taken at different views. The method is also applied to object category recognition using ETH-80 database. The proposed method is shown to outperform the state-of-the-art methods in terms of accuracy and efficiency.
引用
收藏
页码:1005 / 1018
页数:14
相关论文
共 40 条
[11]  
Fukui K, 2006, LECT NOTES COMPUT SC, V3852, P315
[12]   The Amsterdam Library of Object Images [J].
Geusebroek, JM ;
Burghouts, GJ ;
Smeulders, AWM .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2005, 61 (01) :103-112
[13]  
Gittins R, 1985, CANONICAL ANAL REV A
[14]   From still image to video-based face recognition:: An experimental analysis [J].
Hadid, A ;
Pietikäinen, M .
SIXTH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, PROCEEDINGS, 2004, :813-818
[15]   Canonical correlation analysis: An overview with application to learning methods [J].
Hardoon, DR ;
Szedmak, S ;
Shawe-Taylor, J .
NEURAL COMPUTATION, 2004, 16 (12) :2639-2664
[16]   Relations between two sets of variates [J].
Hotelling, H .
BIOMETRIKA, 1936, 28 :321-377
[17]   VIEW OF 3 DECADES OF LINEAR FILTERING THEORY [J].
KAILATH, T .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1974, 20 (02) :146-181
[18]  
Kim TK, 2006, LECT NOTES COMPUT SC, V3953, P251
[19]   Locally linear discriminant analysis for multimodally distributed classes for face recognition with a single model image [J].
Kim, TK ;
Kittler, J .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (03) :318-327
[20]  
Kozakaya T., 2004, Transactions of the Information Processing Society of Japan, V45, P951