Asymptotic Generalization Bound of Fisher's Linear Discriminant Analysis

被引:22
作者
Bian, Wei [1 ,2 ]
Tao, Dacheng [1 ,2 ]
机构
[1] Univ Technol Sydney, Ctr Quantum Computat & Intelligent Syst, Ultimo, NSW 2007, Australia
[2] Univ Technol Sydney, Fac Engn & Informat Technol, Ultimo, NSW 2007, Australia
基金
澳大利亚研究理事会;
关键词
Fisher's linear discriminant analysis; asymptotic generalization analysis; random matrix theory; RECOGNITION; PREDICTION; BAYES;
D O I
10.1109/TPAMI.2014.2327983
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fisher's linear discriminant analysis (FLDA) is an important dimension reduction method in statistical pattern recognition. It has been shown that FLDA is asymptotically Bayes optimal under the homoscedastic Gaussian assumption. However, this classical result has the following two major limitations: 1) it holds only for a fixed dimensionality D, and thus does not apply when D and the training sample size N are proportionally large; 2) it does not provide a quantitative description on how the generalization ability of FLDA is affected by D and N. In this paper, we present an asymptotic generalization analysis of FLDA based on random matrix theory, in a setting where both D and N increase and D/N --> gamma is an element of vertical bar 0, 1). The obtained lower bound of the generalization discrimination power overcomes both limitations of the classical result, i.e., it is applicable when D and N are proportionally large and provides a quantitative description of the generalization ability of FLDA in terms of the ratio gamma = D/N and the population discrimination power. Besides, the discrimination power bound also leads to an upper bound on the generalization error of binary-classification with FLDA.
引用
收藏
页码:2325 / 2337
页数:13
相关论文
共 32 条
[1]  
Alexandre-Cortizo E, 2005, EUROCON 2005: THE INTERNATIONAL CONFERENCE ON COMPUTER AS A TOOL, VOL 1 AND 2 , PROCEEDINGS, P1666
[2]   FINANCIAL RATIOS, DISCRIMINANT ANALYSIS AND PREDICTION OF CORPORATE BANKRUPTCY [J].
ALTMAN, EI .
JOURNAL OF FINANCE, 1968, 23 (04) :589-609
[3]  
Anderson T. W., 1984, An introduction to multivariate statistical analysis, V2nd
[4]  
[Anonymous], 2011, High-dimensional Data Analysis
[5]  
[Anonymous], 2004, RANDOM MATRIX THEORY
[6]  
[Anonymous], P AISTATS
[7]   Eigenfaces vs. Fisherfaces: Recognition using class specific linear projection [J].
Belhumeur, PN ;
Hespanha, JP ;
Kriegman, DJ .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (07) :711-720
[8]   Max-Min Distance Analysis by Using Sequential SDP Relaxation for Dimension Reduction [J].
Bian, Wei ;
Tao, Dacheng .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (05) :1037-1050
[9]   Some theory for Fisher's linear discriminant function, 'naive Bayes', and some alternatives when there are many more variables than observations [J].
Bickel, PJ ;
Levina, E .
BERNOULLI, 2004, 10 (06) :989-1010
[10]  
Billingsley P., 1999, WILEY SERIES PROBABI, V175