Molecular spectroscopic wavelength selection using combined interval partial least squares and correlation coefficient optimization

被引:18
作者
Jiang, Weiwei [1 ]
Lu, Changhua [1 ,2 ]
Zhang, Yujun [2 ]
Ju, Wei [1 ]
Wang, Jizhou [1 ,3 ]
Xiao, Mingxia [1 ]
机构
[1] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei 230009, Anhui, Peoples R China
[2] Chinese Acad Sci, Anhui Inst Opt Fine Mech, Hefei 230031, Anhui, Peoples R China
[3] Hefei Univ, Dept Elect, Hefei 230061, Anhui, Peoples R China
基金
国家高技术研究发展计划(863计划);
关键词
NEAR-INFRARED SPECTROSCOPY; VARIABLE SELECTION; GENETIC ALGORITHM; NIR SPECTROSCOPY; REGRESSION; CLASSIFICATION; CHEMOMETRICS; PROTEIN; SIPLS; IPLS;
D O I
10.1039/c9ay00898e
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Wavelength selection plays a vital role in employing near-infrared spectroscopy for analyzing samples. Existing wavelength selection algorithms present certain drawbacks that can be mitigated by combining algorithms. In this study, we employed a combination of algorithms to quantitatively analyze corn components using near-infrared spectroscopy data. We combined Savitzky-Golay (SG) preprocessing, the correlation coefficient (CC) method, and synergy interval partial least squares (siPLS) algorithms to propose CC-SiPLS and CC-SG-SiPLS methods. The results of applying full-spectrum partial least squares (PLS), correlation coefficient partial least squares (CC-PLS), synergy interval partial least squares (SiPLS), CC-SiPLS, and CC-SG-SiPLS methods to the near-infrared spectral wavelength selection were compared. The results showed that the mathematical models established from the spectral data after wavelength selection using CC, SiPLS, CC-SiPLS, and CC-SG-SiPLS were simplified, and the numbers of wavelengths were 33.6% (CC) and 14.3% (SiPLS), 11.1% (CC-SiPLS), and 6.3% (CC-SG-SiPLS) of that using the full spectrum. The accuracy of predicting the oil content of corn was improved compared to PLS. The CC-SG-SIPLS wavelength selection algorithm combined with the preprocessing method reduced the number of wavelengths from 700 to 44 and the model complexity was the most simplified. The root mean square error in prediction (RMSEP) and relative percent deviation (RPD) were 0.0552 and 2.5706, respectively, demonstrating adequate prediction accuracy. This result indicates that a combination strategy provides an effective way for multiple waveband selection, and that CC-SG-SiPLS can provide high analysis accuracy using molecular absorption bands composed of several wavelength intervals. Thus, this algorithm is an effective and robust wavelength selection strategy.
引用
收藏
页码:3108 / 3116
页数:9
相关论文
共 27 条
  • [11] Vis-NIR wavelength selection for non-destructive discriminant analysis of breed screening of transgenic sugarcane
    Guo, Haosong
    Chen, Jiemei
    Pan, Tao
    Wang, Jihua
    Cao, Gan
    [J]. ANALYTICAL METHODS, 2014, 6 (21) : 8810 - 8816
  • [12] Heman A., 2016, Eng. Agric. Environ. Food, V9, P280, DOI [10.1016/j.eaef.2016.02.002, DOI 10.1016/J.EAEF.2016.02.002]
  • [13] Rapid detection of three quality parameters and classification of wine based on Vis-NIR spectroscopy with wavelength selection by ACO and CARS algorithms
    Hu, Leqian
    Yin, Chunling
    Ma, Shuai
    Liu, Zhimin
    [J]. SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2018, 205 : 574 - 581
  • [14] Variable selection for multivariate calibration using a genetic algorithm: prediction of additive concentrations in polymer films from Fourier transform-infrared spectral data
    Leardi, R
    Seasholtz, MB
    Pell, RJ
    [J]. ANALYTICA CHIMICA ACTA, 2002, 461 (02) : 189 - 200
  • [15] Authenticity identification and classification of Rhodiola species in traditional Tibetan medicine based on Fourier transform near-infrared spectroscopy and chemometrics analysis
    Li, Tao
    Su, Chen
    [J]. SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2018, 204 : 131 - 140
  • [16] Quantitative analysis of glycated albumin in serum based on ATR-FTIR spectrum combined with SiPLS and SVM
    Li, Yuanpeng
    Li, Fucui
    Yang, Xinhao
    Guo, Liu
    Huang, Furong
    Chen, Zhenqiang
    Chen, Xingdan
    Zheng, Shifu
    [J]. SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2018, 201 : 249 - 257
  • [17] Evaluating the reliability of spectral variables selected by subsampling methods
    Lin, Zhaozhou
    Pan, Xiaoning
    Xu, Bing
    Zhang, Jiayu
    Shi, Xinyuan
    Qiao, Yanjiang
    [J]. JOURNAL OF CHEMOMETRICS, 2015, 29 (02) : 87 - 95
  • [18] Recent Advances in Wavelength Selection Techniques for Hyperspectral Image Processing in the Food Industry
    Liu, Dan
    Sun, Da-Wen
    Zeng, Xin-An
    [J]. FOOD AND BIOPROCESS TECHNOLOGY, 2014, 7 (02) : 307 - 323
  • [19] An Optimal Selection Method of Samples of Calibration Set and Validation Set for Spectral Multivariate Analysis
    Liu Wei
    Zhao Zhong
    Yuan Hong-fu
    Song Chun-feng
    Li Xiao-yu
    [J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2014, 34 (04) : 947 - 951
  • [20] Interval partial least-squares regression (iPLS):: A comparative chemometric study with an example from near-infrared spectroscopy
    Norgaard, L
    Saudland, A
    Wagner, J
    Nielsen, JP
    Munck, L
    Engelsen, SB
    [J]. APPLIED SPECTROSCOPY, 2000, 54 (03) : 413 - 419