Gaussian Process Regression and Its Application in Near-Infrared Spectroscopy Analysis

被引:7
|
作者
Feng Ai-ming [1 ]
Fang Li-min [1 ]
Lin Min [1 ]
机构
[1] China Jiliang Univ, Coll Metrol & Measurement Engn, Hangzhou 310018, Peoples R China
关键词
Gaussian process; Near-infrared spectroscopy; Monte Carlo cross validation; Uninformative variable elimination; Quantitative analysis; CALIBRATION; VARIABLES; SELECTION;
D O I
10.3964/j.issn.1000-0593(2011)06-1514-04
中图分类号
O433 [光谱学];
学科分类号
0703 ; 070302 ;
摘要
Gaussian process (GP) is applied in the present paper as a chemometric method to explore the complicated relationship between the near infrared (NIR) spectra and ingredients. After the outliers were detected by Monte Carlo cross validation (MCCV) method and removed from dataset, different preprocessing methods, such as multiplicative scatter correction (MSC), smoothing and derivate, were tried for the best performance of the models. Furthermore, uninformative variable elimination (UVE) was introduced as a variable selection technique and the characteristic wavelengths obtained were further employed as input for modeling. A public dataset with 80 NIR spectra of corn was introduced as an example for evaluating the new algorithm. The optimal models for oil, starch and protein were obtained by the GP regression method. The performance of the final models were evaluated according to the root mean square error of calibration (RMSEC), root mean square error of cross-validation (RMSECV), root mean square error of prediction (RMSEP) and correlation coefficient (r). The models give good calibration ability with r values above 0.99 and the prediction ability is also satisfactory with r values higher than 0.96. The overall results demonstrate that GP algorithm is an effective chemometric method and is promising for the NIR analysis.
引用
收藏
页码:1514 / 1517
页数:4
相关论文
共 11 条
  • [1] [Anonymous], 1996, ADV NEURAL INFORM PR
  • [2] BURNS DA, 2005, HDB NEAR INFRARED AN
  • [3] Elimination of uninformative variables for multivariate calibration
    Centner, V
    Massart, DL
    deNoord, OE
    deJong, S
    Vandeginste, BM
    Sterna, C
    [J]. ANALYTICAL CHEMISTRY, 1996, 68 (21) : 3851 - 3858
  • [4] Bayesian linear regression and variable selection for spectroscopic calibration
    Chen, Tao
    Martin, Elaine
    [J]. ANALYTICA CHIMICA ACTA, 2009, 631 (01) : 13 - 21
  • [5] Fang LM, 2008, CHINESE J ANAL CHEM, V36, P815
  • [6] Detection of the Main Quality Indicators in Red Wine with Infrared Spectroscopy Based on FastICA and Neural Network
    Fang Li-min
    Lin Min
    [J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2009, 29 (08) : 2083 - 2086
  • [7] Gaussian processes:: A method for automatic QSAR Modeling of ADME properties
    Obrezanova, Olga
    Csanyi, Gabor
    Gola, Joelle M. R.
    Segall, Matthew D.
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2007, 47 (05) : 1847 - 1857
  • [8] YAN Yan-lu, 2005, NEAR INFRARED SPECTR, P1
  • [9] Comprehensive comparison of eight statistical modelling methods used in quantitative structure-retention relationship studies for liquid chromatographic retention times of peptides generated by protease digestion of the Escherichia coli proteome
    Zhou, Peng
    Tian, Feifei
    Lv, Fenglin
    Shang, Zhicai
    [J]. JOURNAL OF CHROMATOGRAPHY A, 2009, 1216 (15) : 3107 - 3116
  • [10] Variables selection methods in near-infrared spectroscopy
    Zou Xiaobo
    Zhao Jiewen
    Povey, Malcolm J. W.
    Holmes, Mel
    Mao Hanpin
    [J]. ANALYTICA CHIMICA ACTA, 2010, 667 (1-2) : 14 - 32