A PLS KERNEL ALGORITHM FOR DATA SETS WITH MANY VARIABLES AND FEWER OBJECTS .1. THEORY AND ALGORITHM

被引:225
作者
RANNAR, S
LINDGREN, F
GELADI, P
WOLD, S
机构
[1] Research Group for Chemometrics, Department of Organic Chemistry, University of Umeå, Umeå
关键词
PLS REGRESSION ALGORITHM; KERNEL; MANY-VARIABLE DATA SETS;
D O I
10.1002/cem.1180080204
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A fast PLS regression algorithm dealing with large data matrices with many variables (K) and fewer objects (N) is presented. For such data matrices the classical algorithm is computer-intensive and memory-demanding. Recently, Lindgren et al. (J. Chemometrics, 7, 45-49 (1993)) developed a quick and efficient kernel algorithm for the case with many objects and few variables. The present paper is focused on the opposite case, i.e. many variables and fewer objects. A kernel algorithm is presented based on eigenvectors to the 'kernel' matrix XX(T)YY(T), which is a square, non-symmetric matrix of size N x N, where N is the number of objects. Using the kernel matrix and the association matrices XX(T) (N x N) and YY(T) (N x N), it is possible to calculate all score and loading vectors and hence conduct a complete PLS regression including diagnostics such as R2. This is done without returning to the original data matrices X and Y. The algorithm is presented in equation form, with proofs of some new properties and as MATLAB code.
引用
收藏
页码:111 / 125
页数:15
相关论文
共 24 条
[1]  
BOOKSTEIN F, 1980, EVALUATION ECONOMETR, P75
[2]   COMPARATIVE MOLECULAR-FIELD ANALYSIS (COMFA) .1. EFFECT OF SHAPE ON BINDING OF STEROIDS TO CARRIER PROTEINS [J].
CRAMER, RD ;
PATTERSON, DE ;
BUNCE, JD .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1988, 110 (18) :5959-5967
[3]   SIMPLS - AN ALTERNATIVE APPROACH TO PARTIAL LEAST-SQUARES REGRESSION [J].
DEJONG, S .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1993, 18 (03) :251-263
[4]   SOME COMMENTS ON MAXIMUM-LIKELIHOOD AND PARTIAL LEAST-SQUARES METHODS [J].
DIJKSTRA, T .
JOURNAL OF ECONOMETRICS, 1983, 22 (1-2) :67-90
[5]  
Dunn WI, 1990, COMPREHENSIVE MED CH, V4.0., P691
[6]   2 STRUCTURAL EQUATION MODELS - LISREL AND PLS APPLIED TO CONSUMER EXIT-VOICE THEORY [J].
FORNELL, C ;
BOOKSTEIN, FL .
JOURNAL OF MARKETING RESEARCH, 1982, 19 (04) :440-452
[7]   DRUG DESIGN BY THE METHOD OF RECEPTOR FIT [J].
GOODFORD, PJ .
JOURNAL OF MEDICINAL CHEMISTRY, 1984, 27 (05) :557-564
[8]   PEPTIDE QUANTITATIVE STRUCTURE-ACTIVITY-RELATIONSHIPS, A MULTIVARIATE APPROACH [J].
HELLBERG, S ;
SJOSTROM, M ;
SKAGERBERG, B ;
WOLD, S .
JOURNAL OF MEDICINAL CHEMISTRY, 1987, 30 (07) :1126-1135
[9]  
Hoskuldsson A., 1988, J CHEMOMETR, V2, P211, DOI [DOI 10.1002/CEM.1180020306, 10.1002/cem.1180020306]
[10]   A STRATEGY FOR RANKING ENVIRONMENTALLY OCCURRING CHEMICALS [J].
JONSSON, J ;
ERIKSSON, L ;
SJOSTROM, M ;
WOLD, S ;
TOSATO, ML .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1989, 5 (02) :169-186