THE SPECTRUM OF KERNEL RANDOM MATRICES

被引:127
作者
El Karoui, Noureddine [1 ]
机构
[1] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
关键词
Covariance matrices; kernel matrices; eigenvalues of covariance matrices; multivariate statistical analysis; high-dimensional inference; random matrix theory; machine learning; Hadamard matrix functions; concentration of measure; LARGEST EIGENVALUE; SPACING DISTRIBUTIONS; NO EIGENVALUES; COVARIANCE; LIMIT; ASYMPTOTICS; SUPPORT;
D O I
10.1214/08-AOS648
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We place Ourselves in the setting of high-dimensional statistical inference where the number of variables p in a dataset of interest is of the same order of magnitude as the number of observations n. We consider the spectrum of certain kernel random matrices, in particular n x n matrices whose (i, j)th entry is f(X-i' X-j/p) or f(vertical bar vertical bar X-i - X-j vertical bar vertical bar(2)/p) where p is the dimension of the data, and X-i are independent data vectors. Here f is assumed to be a locally smooth function. The study is motivated by questions arising in statistics and computer science where these matrices are used to perform, among other things, nonlinear versions of principal component analysis. Surprisingly, we show that in high dimensions, and for the models we analyze, the problem becomes essentially linear-which is at odds with heuristics sometimes used to justify the usage of these methods. The analysis also highlights certain Peculiarities of models widely studied in random matrix theory and raises some questions about their relevance as tools to model high-dimensional data encountered in practice.
引用
收藏
页码:1 / 50
页数:50
相关论文
共 45 条
[1]  
Anderson T. W., 2003, An Introduction to Multivariate Statistical Analysis, V3rd
[2]  
[Anonymous], 2000, OPER THEOR
[3]  
[Anonymous], 1998, Ecole d'Ete de Probabilites de Saint-Flour XXVIII-1998
[4]   Kernel independent component analysis [J].
Bach, FR ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (01) :1-48
[5]   On asymptotics of eigenvectors of large sample covariance matrix [J].
Bai, Z. D. ;
Miao, B. Q. ;
Pan, G. M. .
ANNALS OF PROBABILITY, 2007, 35 (04) :1532-1572
[6]  
Bai ZD, 1999, STAT SINICA, V9, P611
[7]  
Bai ZD, 1998, ANN PROBAB, V26, P316
[8]  
Bai ZD, 1999, ANN PROBAB, V27, P1536
[9]   Phase transition of the largest eigenvalue for nonnull complex sample covariance matrices [J].
Baik, J ;
Ben Arous, G ;
Péché, S .
ANNALS OF PROBABILITY, 2005, 33 (05) :1643-1697
[10]   Eigenvalues of large sample covariance matrices of spiked population models [J].
Baik, Jinho ;
Silverstein, Jack W. .
JOURNAL OF MULTIVARIATE ANALYSIS, 2006, 97 (06) :1382-1408