Feature selection using the Kalman filter for classification of multivariate data

被引:17
作者
Wu, W
Rutan, SC
Baldovin, A
Massart, DL
机构
[1] FREE UNIV BRUSSELS,INST PHARMACEUT,CHEMOAC,B-1090 BRUSSELS,BELGIUM
[2] VIRGINIA COMMONWEALTH UNIV,DEPT CHEM,RICHMOND,VA 23284
关键词
chemometrics; classification; feature selection; Kalman filter; infrared spectrometry; LEAST-SQUARES REGRESSION; WAVELENGTH SELECTION; MODEL; PLS;
D O I
10.1016/S0003-2670(96)00347-9
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
A Kalman filter is developed as a feature selection method and classifier for multivariate data. Three near-infrared (NIR) data sets and a pollution data set are analyzed. For the two most difficult data sets (data sets 1 and 3), the Kalman filter successfully selects the wavelengths which lead to very good results with a correct classification rate (CCR) equal to one. These results are much better than the best results obtained from regularized discriminant analysis (RDA) using Fourier transform Fl, principal component regression (PCA) and univariate feature selection methods as the variable reduction methods. For the second data set which consists of more than two classes, the Kalman filter gives similar results (CCR=1) to those of RDA. For the pollution data set (data set 4), the Kalman filter gives similar results to partial least-squares (PLS) using fewer variables. The disadvantage of the Kalman filter is that it needs more memory and more computing time than PLS. The potential hazards of overfitting have been considered.
引用
收藏
页码:11 / 22
页数:12
相关论文
共 20 条
[11]  
NORUSIS MJ, 1992, SPSS WINDOWS BASE SY, P347
[12]  
RUTAN SC, 1991, ANAL CHEM, V63, P1103
[13]   ADAPTIVE KALMAN FILTERING USED TO COMPENSATE FOR MODEL ERRORS IN MULTICOMPONENT METHODS [J].
RUTAN, SC ;
BROWN, SD .
ANALYTICA CHIMICA ACTA, 1984, 160 (JUN) :99-119
[14]   FAST ONLINE DIGITAL FILTERING [J].
RUTAN, SC .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1989, 6 (03) :191-201
[15]  
RUTAN SC, 1990, J CHEMOMETR, V4, P103
[16]   Application of wavelet packet transform in pattern recognition of near-IR data [J].
Walczak, B ;
vandenBogaert, B ;
Massart, DL .
ANALYTICAL CHEMISTRY, 1996, 68 (10) :1742-1747
[17]   Comparison of regularized discriminant analysis, linear discriminant analysis and quadratic discriminant analysis, applied to NIR data [J].
Wu, W ;
Mallet, Y ;
Walczak, B ;
Penninckx, W ;
Massart, DL ;
Heuerding, S ;
Erni, F .
ANALYTICA CHIMICA ACTA, 1996, 329 (03) :257-265
[18]   Artificial neural networks in classification of NIR spectral data: Design of the training set [J].
Wu, W ;
Walczak, B ;
Massart, DL ;
Heuerding, S ;
Erni, F ;
Last, IR ;
Prebble, KA .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1996, 33 (01) :35-46
[19]   SPECTRAL TRANSFORMATION AND WAVELENGTH SELECTION IN NEAR-INFRARED SPECTRA CLASSIFICATION [J].
WU, W ;
WALCZAK, B ;
MASSART, DL ;
PREBBLE, KA ;
LAST, IR .
ANALYTICA CHIMICA ACTA, 1995, 315 (03) :243-255
[20]  
WU W, IN PRESS ANAL CHIM A