Parallel genetic algorithm co-optimization of spectral pre-processing and wavelength selection for PLS regression

被引:49
作者
Devos, Olivier [1 ]
Duponchel, Ludovic [1 ]
机构
[1] Univ Lille 1 Sci & Technol, CNRS, Lab Spectrochim Infrarouge & Raman LASIR, UMR 8516, F-59655 Villeneuve Dascq, France
关键词
Genetic algorithm; Variable selection; Spectral pre-processing; PLS; NIRS; NEAR-INFRARED SPECTROSCOPY; VARIABLE SELECTION; MULTIVARIATE CALIBRATION; EVOLUTIONARY OPTIMIZATION; NEURAL-NETWORKS; CHEMOMETRICS; MODELS; CHEMISTRY;
D O I
10.1016/j.chemolab.2011.01.008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Spectral pre-processing and variable selection are often used to produce PLS regression models with better prediction abilities. We proposed here to optimize simultaneously the spectral pre-processing and the variable selection for PLS regression. The method is based on parallel genetic algorithm with a unique chromosome coding both for pre-processing and variable selections. A pool of 31 pre-processing functions with various settings is tested. In the same chromosome several pre-processing steps can be combined. Three near infrared spectroscopic datasets have been used to evaluate the methodology. The efficacy of the co-optimization is evaluated by comparing the prediction ability of the PLS models with those after pre-processing optimization only. The effect of the number of successive pre-processing steps has been also tested. Concerning the different datasets used here, one can observe two different behaviors. In a first case the GA co-optimization procedure is found to perform well, leading to important improvement of the prediction ability especially when three consecutive pre-processing techniques are applied. In a second case, only the preprocessing optimization is enough to obtain an optimal model. All these models are optimal and more accurate compared to the classical models (build with the "trial and error" methods). (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:50 / 58
页数:9
相关论文
共 29 条
  • [21] Mutual information for the selection of relevant variables in spectrometric nonlinear modelling
    Rossi, F
    Lendasse, A
    François, D
    Wertz, V
    Verleysen, M
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2006, 80 (02) : 215 - 226
  • [22] Multi-Core Computation in Chemometrics: Case Studies of Voltammetric and NIR Spectrometric Analyses
    Soares, Anderson da Silva
    Galvao, Roberto K. H.
    Araujo, Mario Cesar U.
    Soares, Sofacles F. C.
    Pinto, Luiz Alberto
    [J]. JOURNAL OF THE BRAZILIAN CHEMICAL SOCIETY, 2010, 21 (09) : 1626 - 1634
  • [23] Theoretical justification of wavelength selection in PLS calibration development of a new algorithm
    Spiegelman, CH
    McShane, MJ
    Goetz, MJ
    Motamedi, M
    Yue, QL
    Coté, GL
    [J]. ANALYTICAL CHEMISTRY, 1998, 70 (01) : 35 - 44
  • [24] Wavelength selection for multivariate calibration using Tikhonov regularization
    Stout, Forrest
    Kalivas, John H.
    Heberger, Karoly
    [J]. APPLIED SPECTROSCOPY, 2007, 61 (01) : 85 - 95
  • [25] Review of Bayesian neural networks with an application to near infrared spectroscopy
    Thodberg, HH
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1996, 7 (01): : 56 - 72
  • [26] Bayesian nonlinear model selection and neural networks: A conjugate prior approach
    Vila, JP
    Wagner, V
    Neveu, P
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2000, 11 (02): : 265 - 278
  • [27] Evolutionary optimisation: a tutorial
    Wehrens, R
    Buydens, LMC
    [J]. TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 1998, 17 (04) : 193 - 203
  • [28] Direct orthogonal signal correction
    Westerhuis, JA
    de Jong, S
    Smilde, AK
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2001, 56 (01) : 13 - 25
  • [29] Variables selection methods in near-infrared spectroscopy
    Zou Xiaobo
    Zhao Jiewen
    Povey, Malcolm J. W.
    Holmes, Mel
    Mao Hanpin
    [J]. ANALYTICA CHIMICA ACTA, 2010, 667 (1-2) : 14 - 32