Evaluation of the effect of data pre-treatment procedures on classical pattern recognition and principal components analysis:: a case study for the geographical classification of tea

被引:44
作者
Moreda-Piñeiro, A
Marcos, A
Fisher, A
Hill, SJ
机构
[1] Univ Plymouth, Plymouth Environm Res Ctr, Dept Environm Sci, Plymouth PL4 8AA, Devon, England
[2] Univ Santiago de Compostela, Fac Chem, Dept Analyt Chem Nutr & Bromatol, Santiago De Compostela 15706, Spain
来源
JOURNAL OF ENVIRONMENTAL MONITORING | 2001年 / 3卷 / 04期
关键词
D O I
10.1039/b103658k
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
A simple transformation that uses the half-range and central value has been used as a data pre-treatment procedure for principal component analysis (PCA) and pattern recognition techniques. The results obtained have been compared with the results from classical normalisation of data (mean normalisation, maximum normalisation and range normalisation), autoscaling and the minimum-maximum transformation. Three data sets were used in the study. The first was formed by determining 17 elements in 53 tea samples (901 pieces of data). The second and third data sets arose from two long-term drift studies performed to examine instrumental stability at standard and robust conditions. The instruments used were an inductively coupled plasma atomic emission spectrometer and an inductively coupled plasma mass spectrometer. Each drift diagnosis experiment consisted of replicate determinations of a test solution containing 15 analytes at 10 mg l(-1) over 8 h without recalibration. Twenty-nine emission lines were determined 99 times, thus, each data set was formed by 2881 pieces of data. Data pre-treatment was applied to the three data sets prior to the use of principal component analysis, cluster analysis, linear discrimination analysis and soft independent modelling of class analogy. The study revealed that the half-range and central value transformation resulted in a better classification of the tea samples than that achieved using the classical normalisation. The loadings in the PCA for the long-term stability study, under both standard and robust conditions, were found to be similar to the drift trends only when the minimum-maximum transformation and the mean or maximum normalisations were used as data pre-treatments.
引用
收藏
页码:352 / 360
页数:9
相关论文
共 22 条
[1]   Effect of data preprocessing methods in near-infrared diffuse reflectance spectroscopy for the determination of the active compound in a pharmaceutical preparation [J].
Blanco, M ;
Coello, J ;
Iturriaga, H ;
Maspoch, S ;
delaPezuela, C .
APPLIED SPECTROSCOPY, 1997, 51 (02) :240-246
[2]  
BRERETON RG, 1990, CHEM APPL MATH STAT, P241
[3]  
BROWN SD, 1992, ANAL CHEM, V64, pR22
[4]  
*CAMO ASA, 1998, UNSCR US MAN, P45
[5]   On-line application of the orthogonal projection approach (OPA) and the soft independent modelling of class analogy approach (SIMCA) for the detection of the end point of a polymorph conversion reaction by near infrared spectroscopy (NIR) [J].
De Braekeleer, K ;
De Maesschalck, R ;
Hailey, PA ;
Sharp, DCA ;
Massart, DL .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1999, 46 (02) :103-116
[6]   Evaluation of data pretreatment and model building methods for the determination of glucose from near-infrared single-beam spectra [J].
Ding, Q ;
Small, GW ;
Arnold, MA .
APPLIED SPECTROSCOPY, 1999, 53 (04) :402-414
[7]   Application and comparisons of chemometric techniques for calibration modelling using electrochemical/ICP-MS data for trace elements in UHQ water and humic acid matrices [J].
Donachie, A ;
Walmsley, AD ;
Haswell, SJ .
ANALYTICA CHIMICA ACTA, 1999, 378 (1-3) :235-243
[8]   Multivariate calibration modelling using electrochemical/inductively coupled plasma mass spectrometry data for trace elements in ultrahigh quality water and humic acid matrices [J].
Donachie, A ;
Walmsley, AD ;
Haswell, SJ .
ANALYTICAL COMMUNICATIONS, 1996, 33 (09) :293-296
[9]  
EINAX JW, 1997, CHEMOMETRICS ENV ANA, P140
[10]  
EINAX JW, 1997, CHEMOMETRICS ENV ANA, P78