Combining SO-PLS and linear discriminant analysis for multi-block classification

被引:74
作者
Biancolillo, Alessandra [1 ,2 ]
Mage, Ingrid [1 ]
Naes, Tormod [1 ,2 ]
机构
[1] Nofima AS, N-1431 As, Norway
[2] Univ Copenhagen, Fac Life Sci, Dept Food Sci, DK-1958 Frederiksberg C, Denmark
关键词
SO-PLS; Multiblock; Linear discriminant analysis; Regression; Classification; PARTIAL LEAST-SQUARES; DATA BLOCKS; REGRESSION; MODELS;
D O I
10.1016/j.chemolab.2014.12.001
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The aim of the present work is to extend the Sequentially Orthogonalized-Partial Least Squares (SO-PLS) regression method, usually used for continuous output, to situations where classification is the main purpose. For this reason SO-PLS discriminant analysis will be compared with other commonly used techniques such as Partial Least Squares-Discriminant Analysis (PLS-DA) and Multiblock-Partial Least Squares Discriminant Analysis (MB-PLS-DA). In particular we will focus on how multiblock strategies can give better discrimination than by analyzing the individual blocks. We will also show that SO-PLS discriminant analysis yields some valuable interpretation tools that give additional insight into the data. We will introduce some new ways to represent the information, taking into account both interpretation and predictive aspects. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:58 / 67
页数:10
相关论文
共 26 条
[1]   Structure-revealing data fusion [J].
Acar, Evrim ;
Papalexakis, Evangelos E. ;
Gurdeniz, Gozde ;
Rasmussen, Morten A. ;
Lawaetz, Anders J. ;
Nilsson, Mathias ;
Bro, Rasmus .
BMC BIOINFORMATICS, 2014, 15
[2]  
[Anonymous], MATRIX PENCILS, DOI DOI 10.1007/BFB0062108
[3]   Partial least squares for discrimination [J].
Barker, M ;
Rayens, W .
JOURNAL OF CHEMOMETRICS, 2003, 17 (03) :166-173
[4]   Multivariate data analysis as a tool in advanced quality monitoring in the food production chain [J].
Bro, R ;
van den Berg, F ;
Thybo, A ;
Andersen, CM ;
Jorgensen, BM ;
Andersen, H .
TRENDS IN FOOD SCIENCE & TECHNOLOGY, 2002, 13 (6-7) :235-244
[5]   The use of multiple measurements in taxonomic problems [J].
Fisher, RA .
ANNALS OF EUGENICS, 1936, 7 :179-188
[6]   On the interpretation of x(2) from contingency tables, and the calculation of P [J].
Fisher, RA .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY, 1922, 85 :87-94
[7]   Analysis of -omics data: Graphical interpretation- and validation tools in multi-block methods [J].
Hassani, Sahar ;
Martens, Harald ;
Qannari, El Mostafa ;
Hanafi, Mohamed ;
Borge, Grethe Iren ;
Kohler, Achim .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2010, 104 (01) :140-153
[8]   Multivariate strategies for classification based on NIR-spectra -: with application to mayonnaise [J].
Indahl, UG ;
Sahni, NS ;
Kirkhus, B ;
Næs, T .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1999, 49 (01) :19-31
[9]   From dummy regression to prior probabilities in PLS-DA [J].
Indahl, Ulf G. ;
Martens, Harald ;
Naes, Tormod .
JOURNAL OF CHEMOMETRICS, 2007, 21 (12) :529-536
[10]   OnPLS-a novel multiblock method for the modelling of predictive and orthogonal variation [J].
Lofstedt, Tommy ;
Trygg, Johan .
JOURNAL OF CHEMOMETRICS, 2011, 25 (08) :441-455