Feature selection using the Kalman filter for classification of multivariate data

被引:17
作者
Wu, W
Rutan, SC
Baldovin, A
Massart, DL
机构
[1] FREE UNIV BRUSSELS,INST PHARMACEUT,CHEMOAC,B-1090 BRUSSELS,BELGIUM
[2] VIRGINIA COMMONWEALTH UNIV,DEPT CHEM,RICHMOND,VA 23284
关键词
chemometrics; classification; feature selection; Kalman filter; infrared spectrometry; LEAST-SQUARES REGRESSION; WAVELENGTH SELECTION; MODEL; PLS;
D O I
10.1016/S0003-2670(96)00347-9
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
A Kalman filter is developed as a feature selection method and classifier for multivariate data. Three near-infrared (NIR) data sets and a pollution data set are analyzed. For the two most difficult data sets (data sets 1 and 3), the Kalman filter successfully selects the wavelengths which lead to very good results with a correct classification rate (CCR) equal to one. These results are much better than the best results obtained from regularized discriminant analysis (RDA) using Fourier transform Fl, principal component regression (PCA) and univariate feature selection methods as the variable reduction methods. For the second data set which consists of more than two classes, the Kalman filter gives similar results (CCR=1) to those of RDA. For the pollution data set (data set 4), the Kalman filter gives similar results to partial least-squares (PLS) using fewer variables. The disadvantage of the Kalman filter is that it needs more memory and more computing time than PLS. The potential hazards of overfitting have been considered.
引用
收藏
页码:11 / 22
页数:12
相关论文
共 20 条
[1]  
BALDOVIN A, IN PRESS ANALYST
[2]   PREDICTIVE ABILITY OF REGRESSION-MODELS .2. SELECTION OF THE BEST PREDICTIVE PLS MODEL [J].
BARONI, M ;
CLEMENTI, S ;
CRUCIANI, G ;
COSTANTINO, G ;
RIGANELLI, D ;
OBERRAUCH, E .
JOURNAL OF CHEMOMETRICS, 1992, 6 (06) :347-356
[3]   THE KALMAN FILTER IN ANALYTICAL-CHEMISTRY [J].
BROWN, SD .
ANALYTICA CHIMICA ACTA, 1986, 181 :1-29
[4]  
CENTNER V, IN PRESS ANAL CHEM
[5]   INTERMEDIATE LEAST-SQUARES REGRESSION METHOD [J].
FRANK, IE .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1987, 1 (03) :233-242
[6]   Wavelength selection method for multicomponent spectrophotometric determinations using partial least squares [J].
Frenich, AG ;
JouanRimbaud, D ;
Massart, DL ;
Kuttatharmmakul, S ;
Galera, MM ;
Vidal, JLM .
ANALYST, 1995, 120 (12) :2787-2792
[7]   PARTIAL LEAST-SQUARES REGRESSION - A TUTORIAL [J].
GELADI, P ;
KOWALSKI, BR .
ANALYTICA CHIMICA ACTA, 1986, 185 :1-17
[8]   COMPARISON OF MULTIVARIATE METHODS BASED ON LATENT VECTORS AND METHODS BASED ON WAVELENGTH SELECTION FOR THE ANALYSIS OF NEAR-INFRARED SPECTROSCOPIC DATA [J].
JOUANRIMBAUD, D ;
WALCZAK, B ;
MASSART, DL ;
LAST, IR ;
PREBBLE, KA .
ANALYTICA CHIMICA ACTA, 1995, 304 (03) :285-295
[9]   INTERACTIVE VARIABLE SELECTION (IVS) FOR PLS .1. THEORY AND ALGORITHMS [J].
LINDGREN, F ;
GELADI, P ;
RANNAR, S ;
WOLD, S .
JOURNAL OF CHEMOMETRICS, 1994, 8 (05) :349-363
[10]  
Marten H., 1991, MULTIVARIATE CALIBRA