The kernel PCA algorithms for wide data .1. Theory and algorithms

被引:94
作者
Wu, W
Massart, DL
deJong, S
机构
[1] FREE UNIV BRUSSELS,INST PHARMACEUT,CHEMOAC,B-1090 BRUSSELS,BELGIUM
[2] UNILEVER RES LABS VLAARDINGEN,NL-3133 AT VLAARDINGEN,NETHERLANDS
关键词
PCA algorithm; kernel; wide data;
D O I
10.1016/S0169-7439(97)00010-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Four classic PCA algorithms: NIPALS, the power method (POWER), singular value decomposition (SVD) and eigenvalue decomposition (EVD) are modified into their kernel version to analyse wide data sets. For such data sets with many variables and fewer objects, the classic algorithms become very inefficient, because the size of the associated matrix X'.X (pXp) is very large. Based on the kemel matrix X.X' (nXn), the kernel algorithms are developed. They yield the same principal components but, when the number of variables is higher than the number of objects, they are faster than the corresponding classic algorithms. Simulation results confirm this property and also show that the kernel EVD is the most efficient algorithm for wide data sets, and that the difference between the kernel SVD and EVD is not very large. Therefore, it is recommended to use the kernel EVD for wide data sets whether all PCs or only the first few are required. A Matlab PCA program which efficiently deals with all kinds of data sets was developed.
引用
收藏
页码:165 / 172
页数:8
相关论文
共 32 条
  • [1] [Anonymous], 1983, INTERPRETATION ANAL
  • [2] [Anonymous], 1989, MULTIVARIATE CALIBRA
  • [3] CROSS-VALIDATORY CHOICE OF THE NUMBER OF COMPONENTS FROM A PRINCIPAL COMPONENT ANALYSIS
    EASTMENT, HT
    KRZANOWSKI, WJ
    [J]. TECHNOMETRICS, 1982, 24 (01) : 73 - 77
  • [4] ESBENSEN K, 1994, MULTIVARIATE ANAL PR
  • [5] EVANS DG, 1993, J NEAR INFRARED SPEC, V1, P209
  • [6] PARTIAL LEAST-SQUARES REGRESSION - A TUTORIAL
    GELADI, P
    KOWALSKI, BR
    [J]. ANALYTICA CHIMICA ACTA, 1986, 185 : 1 - 17
  • [7] Golub GH, 2013, Matrix Computations, V4
  • [8] Jackson JE, 1991, A user's guide to principal components
  • [9] Karjalainen E.J., 1996, DATA ANAL HYPHENATED
  • [10] Kowalski B. R., 1984, CHEMOMETRICS MATH ST