An automated procedure to predict the number of components in spectroscopic data

被引:127
作者
Elbergali, A [1 ]
Nygren, J [1 ]
Kubista, M [1 ]
机构
[1] Chalmers Univ Technol, Dept Biochem & Biophys, SE-41390 Gothenburg, Sweden
关键词
chemometrics; indicator functions; spectroscopic data;
D O I
10.1016/S0003-2670(98)00640-0
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
We have compared various statistical methods to estimate the number of components that contribute to a set of spectra. The methods are tested both on simulated and on experimental data. No assumptions are made about noise level, since this in most experimental situations is unknown. For tests that formally require such information we have devised novel criteria for their predictions. The criteria have been integrated with the NIPALS algorithm to create a routine that in an automated way predicts the number of components. We find that the methods almost always predict the correct number of components when the quality of data is high. Also for multi-component samples and at high-noise levels most of these methods make satisfactory predictions. Those that gave the overall best results were the factor indicator function (IND) and the imbedded error function (IE). The F-test also worked well, but it has the disadvantage that a significance level must be chosen rather arbitrarily. The residual standard deviation (RSD), the root mean square (RMS), the chi-squared and the residual percentage variance (RPV) tests also gave satisfactory results. Less good were the eigenvalue (EV) and the reduced eigenvalue (REV). The ability of all indicators to predict the number of components was significantly improved when the degree of digitalization of the spectra was increased. (C) 1999 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:143 / 158
页数:16
相关论文
共 31 条
[1]  
[Anonymous], MULTIVAR BEHAV RES
[2]  
[Anonymous], 1989, MULTIVARIATE CALIBRA
[3]   TESTS OF SIGNIFICANCE IN FACTOR ANALYSIS [J].
Bartlett, M. S. .
BRITISH JOURNAL OF PSYCHOLOGY-STATISTICAL SECTION, 1950, 3 :77-85
[4]  
COX D, 1984, ANAL SURVIVAL DATA C
[5]   CROSS-VALIDATORY CHOICE OF THE NUMBER OF COMPONENTS FROM A PRINCIPAL COMPONENT ANALYSIS [J].
EASTMENT, HT ;
KRZANOWSKI, WJ .
TECHNOMETRICS, 1982, 24 (01) :73-77
[6]   INFLUENCE OF PREDICTED ELUTION REGIONS ON THE PERFORMANCE OF METHODS FOR EVOLUTIONARY FACTOR-ANALYSIS AS APPLIED TO HIGH-PERFORMANCE LIQUID-CHROMATOGRAPHY [J].
ELBERGALI, AK ;
BRERETON, RG .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1995, 27 (01) :55-71
[7]   RESOLVABILITY INDEXES AND VARIABLE SELECTION IN FACTOR-ANALYSIS OF DIODE-ARRAY HIGH-PERFORMANCE LIQUID-CHROMATOGRAPHY - DEVELOPMENT OF AN APPROACH FOR MULTI-PEAK CLUSTERS AND APPLICATION TO MIXTURES OF CHLOROPHYLL DEGRADATION PRODUCTS [J].
ELBERGALI, AK ;
BRERETON, RG ;
RAHMANI, A .
ANALYST, 1995, 120 (08) :2207-2216
[8]   INFLUENCE OF NOISE, PEAK POSITION AND SPECTRAL SIMILARITIES ON RESOLVABILITY OF DIODE-ARRAY HIGH-PERFORMANCE LIQUID-CHROMATOGRAPHY BY EVOLUTIONARY FACTOR-ANALYSIS [J].
ELBERGALI, AK ;
BRERETON, RG .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1994, 23 (01) :97-106
[9]   Influence of the method of calculation of noise thresholds on wavelength selection in window factor analysis of diode array high-performance liquid chromatography [J].
Elbergali, AK ;
Brereton, RG ;
Rahmani, A .
ANALYST, 1996, 121 (05) :585-590
[10]   Critical evaluation of two F-tests for selecting the number of factors in abstract factor analysis [J].
Faber, K ;
Kowalski, BR .
ANALYTICA CHIMICA ACTA, 1997, 337 (01) :57-71