A classification tool for N-way array based on SIMCA methodology

被引:45
作者
Durante, Caterina [1 ]
Bro, Rasmus [2 ]
Cocchi, Marina [1 ]
机构
[1] Univ Modena & Reggio Emilia, Dept Chem, I-41125 Modena, Italy
[2] Univ Copenhagen, Dept Food Sci, Fac Life Sci, DK-1958 Frederiksberg C, Denmark
关键词
SIMCA; Multi-way classification; Discriminant analysis; Class modelling; PARAFAC; Tucker; PRINCIPAL COMPONENT ANALYSIS; PARALLEL FACTOR-ANALYSIS; MULTIVARIATE CLASSIFICATION; OLIVE OILS; FLUORESCENCE; SPECTROSCOPY; MODELS;
D O I
10.1016/j.chemolab.2010.09.004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the literature there are only few papers concerned with classification methods for multi-way arrays. The most common procedure, by far, is to unfold the multi-way data array into an ordinary matrix and then to apply the traditional multivariate tools for classification. As opposed to unfolding the data several possibilities exist for building classification models more directly based on the multi-way structure of the data. As an example, multi-way partial least squares discriminant analysis has been used as a supervised classification method, another alternative that has been investigated is to perform classification using Fisher's LDA or SIMCA on the score matrix from e.g. a PARAFAC or a Tucker model. Despite a few attempts of applying such multi-way classification approaches, no-one has looked into how such models are best built and implemented. In this work, the SIMCA method is extended to three-way arrays. Included in this work is also actual code that will work on general multi-way arrays rather than just three-way arrays. In analogy with two-way SIMCA. a decomposition model is separately built for the multi-way data for each class, using multi-way decomposition method such as PARAFAC or Tucker3. In the choice of the best class dimensionality, i.e. number of latent factors, both the results of cross-validation but mainly the sensitivity/specificity values are evaluated. In order to estimate the class limits for each class model, orthogonal and score distances are considered, and different statistics are implemented and tested to set confidence limits for these two parameters. Classification performance using different definitions of class boundaries and classification rules, including the use of cross-validated residuals and scores is compared. The proposed N-SIMCA methodology and code, besides simulated data sets of varying dimensionality, has been tested on two case studies, concerning food authentication tasks for typical food products. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:73 / 85
页数:13
相关论文
共 39 条
[21]   Estuarine water classification using EEM spectroscopy and PARAFAC-SIMCA [J].
Hall, Gregory J. ;
Kenny, Jonathan E. .
ANALYTICA CHIMICA ACTA, 2007, 581 (01) :118-124
[22]   Classification of sags gathered in distribution substations based on multiway principal component analysis [J].
Khosravi, Abbas ;
Melendez, Joaquim ;
Colomer, Joan .
ELECTRIC POWER SYSTEMS RESEARCH, 2009, 79 (01) :144-151
[23]   SIMCA MULTIVARIATE DATA-ANALYSIS OF BLUE MUSSEL COMPONENTS IN ENVIRONMENTAL-POLLUTION STUDIES [J].
KVALHEIM, OM ;
OYGARD, K ;
GRAHLNIELSEN, O .
ANALYTICA CHIMICA ACTA, 1983, 150 (01) :145-152
[24]  
Leardi R, 2000, J CHEMOMETR, V14, P187, DOI 10.1002/1099-128X(200005/06)14:3<187::AID-CEM593>3.0.CO
[25]  
2-0
[26]   Multivariate statistical process control of batch processes based on three-way models [J].
Louwerse, DJ ;
Smilde, AK .
CHEMICAL ENGINEERING SCIENCE, 2000, 55 (07) :1225-1235
[27]  
Maesschalck RD, 1999, CHEMOM INTELL LAB SY, V47, P65, DOI DOI 10.1016/S0169-7439(98)00159-2
[28]  
Massart D. L., 1998, DATA HANDLING SCI TE, DOI DOI 10.1016/S0922-3487(97)80055-X
[29]   Monitoring chemical changes of dry-cured parma ham during processing by surface autofluorescence spectroscopy [J].
Moller, JKS ;
Parolari, G ;
Gabba, L ;
Christensen, J ;
Skibsted, LH .
JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY, 2003, 51 (05) :1224-1230
[30]   Acceptance areas for multivariate classification derived by projection methods [J].
Pomerantsev, Alexey L. .
JOURNAL OF CHEMOMETRICS, 2008, 22 (11-12) :601-609