Molecular spectroscopic wavelength selection using combined interval partial least squares and correlation coefficient optimization

被引:18
作者
Jiang, Weiwei [1 ]
Lu, Changhua [1 ,2 ]
Zhang, Yujun [2 ]
Ju, Wei [1 ]
Wang, Jizhou [1 ,3 ]
Xiao, Mingxia [1 ]
机构
[1] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei 230009, Anhui, Peoples R China
[2] Chinese Acad Sci, Anhui Inst Opt Fine Mech, Hefei 230031, Anhui, Peoples R China
[3] Hefei Univ, Dept Elect, Hefei 230061, Anhui, Peoples R China
基金
国家高技术研究发展计划(863计划);
关键词
NEAR-INFRARED SPECTROSCOPY; VARIABLE SELECTION; GENETIC ALGORITHM; NIR SPECTROSCOPY; REGRESSION; CLASSIFICATION; CHEMOMETRICS; PROTEIN; SIPLS; IPLS;
D O I
10.1039/c9ay00898e
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Wavelength selection plays a vital role in employing near-infrared spectroscopy for analyzing samples. Existing wavelength selection algorithms present certain drawbacks that can be mitigated by combining algorithms. In this study, we employed a combination of algorithms to quantitatively analyze corn components using near-infrared spectroscopy data. We combined Savitzky-Golay (SG) preprocessing, the correlation coefficient (CC) method, and synergy interval partial least squares (siPLS) algorithms to propose CC-SiPLS and CC-SG-SiPLS methods. The results of applying full-spectrum partial least squares (PLS), correlation coefficient partial least squares (CC-PLS), synergy interval partial least squares (SiPLS), CC-SiPLS, and CC-SG-SiPLS methods to the near-infrared spectral wavelength selection were compared. The results showed that the mathematical models established from the spectral data after wavelength selection using CC, SiPLS, CC-SiPLS, and CC-SG-SiPLS were simplified, and the numbers of wavelengths were 33.6% (CC) and 14.3% (SiPLS), 11.1% (CC-SiPLS), and 6.3% (CC-SG-SiPLS) of that using the full spectrum. The accuracy of predicting the oil content of corn was improved compared to PLS. The CC-SG-SIPLS wavelength selection algorithm combined with the preprocessing method reduced the number of wavelengths from 700 to 44 and the model complexity was the most simplified. The root mean square error in prediction (RMSEP) and relative percent deviation (RPD) were 0.0552 and 2.5706, respectively, demonstrating adequate prediction accuracy. This result indicates that a combination strategy provides an effective way for multiple waveband selection, and that CC-SG-SiPLS can provide high analysis accuracy using molecular absorption bands composed of several wavelength intervals. Thus, this algorithm is an effective and robust wavelength selection strategy.
引用
收藏
页码:3108 / 3116
页数:9
相关论文
共 27 条
[11]   Vis-NIR wavelength selection for non-destructive discriminant analysis of breed screening of transgenic sugarcane [J].
Guo, Haosong ;
Chen, Jiemei ;
Pan, Tao ;
Wang, Jihua ;
Cao, Gan .
ANALYTICAL METHODS, 2014, 6 (21) :8810-8816
[12]  
Heman A., 2016, Eng. Agric. Environ. Food, V9, P280, DOI [10.1016/j.eaef.2016.02.002, DOI 10.1016/J.EAEF.2016.02.002]
[13]   Rapid detection of three quality parameters and classification of wine based on Vis-NIR spectroscopy with wavelength selection by ACO and CARS algorithms [J].
Hu, Leqian ;
Yin, Chunling ;
Ma, Shuai ;
Liu, Zhimin .
SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2018, 205 :574-581
[14]   Variable selection for multivariate calibration using a genetic algorithm: prediction of additive concentrations in polymer films from Fourier transform-infrared spectral data [J].
Leardi, R ;
Seasholtz, MB ;
Pell, RJ .
ANALYTICA CHIMICA ACTA, 2002, 461 (02) :189-200
[15]   Authenticity identification and classification of Rhodiola species in traditional Tibetan medicine based on Fourier transform near-infrared spectroscopy and chemometrics analysis [J].
Li, Tao ;
Su, Chen .
SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2018, 204 :131-140
[16]   Quantitative analysis of glycated albumin in serum based on ATR-FTIR spectrum combined with SiPLS and SVM [J].
Li, Yuanpeng ;
Li, Fucui ;
Yang, Xinhao ;
Guo, Liu ;
Huang, Furong ;
Chen, Zhenqiang ;
Chen, Xingdan ;
Zheng, Shifu .
SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2018, 201 :249-257
[17]   Evaluating the reliability of spectral variables selected by subsampling methods [J].
Lin, Zhaozhou ;
Pan, Xiaoning ;
Xu, Bing ;
Zhang, Jiayu ;
Shi, Xinyuan ;
Qiao, Yanjiang .
JOURNAL OF CHEMOMETRICS, 2015, 29 (02) :87-95
[18]   Recent Advances in Wavelength Selection Techniques for Hyperspectral Image Processing in the Food Industry [J].
Liu, Dan ;
Sun, Da-Wen ;
Zeng, Xin-An .
FOOD AND BIOPROCESS TECHNOLOGY, 2014, 7 (02) :307-323
[19]   An Optimal Selection Method of Samples of Calibration Set and Validation Set for Spectral Multivariate Analysis [J].
Liu Wei ;
Zhao Zhong ;
Yuan Hong-fu ;
Song Chun-feng ;
Li Xiao-yu .
SPECTROSCOPY AND SPECTRAL ANALYSIS, 2014, 34 (04) :947-951
[20]   Interval partial least-squares regression (iPLS):: A comparative chemometric study with an example from near-infrared spectroscopy [J].
Norgaard, L ;
Saudland, A ;
Wagner, J ;
Nielsen, JP ;
Munck, L ;
Engelsen, SB .
APPLIED SPECTROSCOPY, 2000, 54 (03) :413-419