Kernel-based calibration methods combined with multivariate feature selection to improve accuracy of near-infrared spectroscopic analysis

被引:14
作者
Lee, Junghye [1 ]
Chang, Kyeol [2 ]
Jun, Chi-Hyuck [1 ]
Cho, Rae-Kwang [3 ]
Chung, Hoeil [2 ]
Lee, Hyeseon [1 ]
机构
[1] Pohang Univ Sci & Technol, Dept Ind & Management Engn, Pohang 790784, South Korea
[2] Hanyang Univ, Res Inst Convergence Basic Sci, Dept Chem, Seoul 133791, South Korea
[3] Kyungpook Natl Univ, Sch Appl Biosci, Coll Agr & Life Sci, Taegu 702701, South Korea
基金
新加坡国家研究基金会;
关键词
Filter method; Kernel partial least squares regression; Kernel support vector regression; Variable importance in projection score; Weight vector coefficient; PARTIAL LEAST-SQUARES; SUPPORT VECTOR REGRESSION; VARIABLE SELECTION; NIR SPECTROSCOPY; GENETIC ALGORITHM; LOW-DENSITY; PLS; PREDICTION; CLASSIFICATION; ELIMINATION;
D O I
10.1016/j.chemolab.2015.08.009
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present kernel-based calibration models combined with multivariate feature selection for complex quantitative near-infrared (NIR) spectroscopic analysis of three different types of sample. Because the spectra include hundreds of features (variables), an optimal selection of features that provide relevant information for target analysis improves the accuracy of spectroscopic analysis. For this purpose, we combined feature selection with kernel partial least squares regression and kernel support vector regression (K-SVR) by evaluating ranking of the features based on their variable importance in projection scores and weight vector coefficients, respectively. Then, the methods were applied to identify components in three datasets of NIR spectra. The kernel-based models without feature selection and the kernel-based models with other feature selection methods were also used for comparison. K-SVR combined with feature selection was effective when the spectral features of samples were complex and recognition of minute spectral variation was necessary for modeling. The combination of feature selection and kernel calibration model can improve the accuracy of spectral analysis by keeping optimal features. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:139 / 146
页数:8
相关论文
共 73 条
[1]  
Abe S, 2010, ADV PATTERN RECOGNIT, P1, DOI 10.1007/978-1-84996-098-4
[2]   Variable selection in wavelet regression models [J].
Alsberg, BK ;
Woodward, AM ;
Winson, MK ;
Rowland, JJ ;
Kell, DB .
ANALYTICA CHIMICA ACTA, 1998, 368 (1-2) :29-44
[3]   Determination of diesel quality parameters using support vector regression and near infrared spectroscopy for an in-line blending optimizer system [J].
Alves, Julio Cesar L. ;
Henriques, Claudete B. ;
Poppi, Ronei J. .
FUEL, 2012, 97 :710-717
[4]  
[Anonymous], SUPPORT VECTOR MACHI
[5]  
[Anonymous], 2000, The Nature of Statistical Learning Theory
[6]   Support vector machine regression (SVR/LS-SVM)-an alternative to neural networks (ANN) for analytical chemistry? Comparison of nonlinear methods on near infrared (NIR) spectroscopy data [J].
Balabin, Roman M. ;
Lomakina, Ekaterina I. .
ANALYST, 2011, 136 (08) :1703-1712
[7]   Neural network (ANN) approach to biodiesel analysis: Analysis of biodiesel density, kinematic viscosity, methanol and water contents using near infrared (NIR) spectroscopy [J].
Balabin, Roman M. ;
Lomakina, Ekaterina I. ;
Safieva, Ravilya Z. .
FUEL, 2011, 90 (05) :2007-2015
[8]  
Basak D., 2007, Efficient Learning Machines: Theories, Concepts, and Applications for Engineers and System Designers, V11, P203, DOI DOI 10.1007/978-1-4302-5990-9_4
[9]   Apples Nutraceutic Properties Evaluation Through a Visible and Near-Infrared Portable System [J].
Beghi, R. ;
Spinardi, A. ;
Bodria, L. ;
Mignani, I. ;
Guidetti, R. .
FOOD AND BIOPROCESS TECHNOLOGY, 2013, 6 (09) :2547-2554
[10]  
Bishop Christopher, 2006, Pattern Recognition and Machine Learning, DOI 10.1117/1.2819119