New near infrared wavelength selection algorithm based on Monte-Carlo method

被引:0
作者
Hong M. [1 ,2 ,3 ]
Wen Q. [3 ]
Wen Z. [1 ,2 ]
机构
[1] Key Laboratory of Micro/Nano Devices and System Technology, Chongqing University
[2] Microsystem Research Center of Chongqing University
[3] School of Software Engineering, Chongqing University
来源
Guangxue Xuebao/Acta Optica Sinica | 2010年 / 30卷 / 12期
关键词
Monte-Carlo; Near infrared; Spectroscopy; Wavelength selection;
D O I
10.3788/AOS20103012.3637
中图分类号
学科分类号
摘要
Based on the feature of the (NIR) spectra, this paper analyses the method of wavelength selection using the partial least squaresc (PLS) regression coefficients and points out the existing problems, then proposes a new method for selecting wavelengths. It normalizes the PLS regression coefficients into the probability of the selected corresponding wavelengths, then a Monte-Carlo simulation based on the aforementioned probability is calculated. Some PLS models are constructed and evaluated using different random wavelengths combinations. The model with minimum predictive error is retained and the corresponding wavelength combinations are selected. This procedure can be iterated using the previous selected wavelengths to select fewer and fewer wavelengths. This method is tested on 3 NIR datasets and compared with the PLS-based uninformative variable elimination (UVE-PLS) and genetic algorithm (GA). Experimental results show that this method could select fewer wavelengths without sacrificing the complexity and predictive ability of the PLS model and could effectively improve the accuracy and stability of the wavelength selection.
引用
收藏
页码:3637 / 3642
页数:5
相关论文
共 21 条
  • [11] Mitchell T.J., Beauchamp J.J., Bayesian variable selection in linear regression, Journal of the American Statistical Association, 83, 404, pp. 1023-1032, (1988)
  • [12] Chen T., Martin E., Bayesian linear regression and variable selection for spectroscopic calibration, Analytica Chimica Acta, 631, pp. 13-21, (2009)
  • [13] Frenich A.G., Jouan-Rimbaud D., Massart D.L., Et al., Wavelength selection method for multicomponent spectrophotometric determinations using partial least squares, Analyst., 120, pp. 2787-2792, (1995)
  • [14] Teofilo R.F., Martins J.P.A., Ferreira M.M.C., Sorting variables by using informative vectors as a strategy for feature selection in multivariate regression, Journal of Chemometrics, 23, 1, pp. 32-48, (2009)
  • [15] Centner V., Massart D.L., de Noord O.E., Et al., Elimination of uninformative variables for multivariate calibration, Analytical Chemistry, 68, 21, pp. 3851-3858, (1996)
  • [16] Stone M., Brooks R.J., Continuum regression: cross-validated sequentially constructed prediction embracing ordinary least squares, partial least squares and principal components regression, Journal of the Royal Statistical Society. Series B(Methodological), 52, 2, pp. 237-269, (1990)
  • [17] Martens H.A., Dardenne P., Validation and verification of regression in small data sets, Chemometrics and Intelligent Laboratory Systems, 44, 1-2, pp. 99-121, (1998)
  • [18] Xu Q.S., Liang Y.Z., Monte Carlo cross validation, Chemometrics and Intelligent Laboratory Systems, 56, 1, pp. 1-11, (2001)
  • [19] Validation of regression models: methods and examples, Technometrics, 19, 4, pp. 415-428, (1977)
  • [20] Kalivas J.H., Two data sets of near infrared spectra, Chemometrics and Intelligent Laboratory Systems, 37, 2, pp. 255-259, (1997)