Molecular spectroscopic wavelength selection using combined interval partial least squares and correlation coefficient optimization

被引:18
作者
Jiang, Weiwei [1 ]
Lu, Changhua [1 ,2 ]
Zhang, Yujun [2 ]
Ju, Wei [1 ]
Wang, Jizhou [1 ,3 ]
Xiao, Mingxia [1 ]
机构
[1] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei 230009, Anhui, Peoples R China
[2] Chinese Acad Sci, Anhui Inst Opt Fine Mech, Hefei 230031, Anhui, Peoples R China
[3] Hefei Univ, Dept Elect, Hefei 230061, Anhui, Peoples R China
基金
国家高技术研究发展计划(863计划);
关键词
NEAR-INFRARED SPECTROSCOPY; VARIABLE SELECTION; GENETIC ALGORITHM; NIR SPECTROSCOPY; REGRESSION; CLASSIFICATION; CHEMOMETRICS; PROTEIN; SIPLS; IPLS;
D O I
10.1039/c9ay00898e
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Wavelength selection plays a vital role in employing near-infrared spectroscopy for analyzing samples. Existing wavelength selection algorithms present certain drawbacks that can be mitigated by combining algorithms. In this study, we employed a combination of algorithms to quantitatively analyze corn components using near-infrared spectroscopy data. We combined Savitzky-Golay (SG) preprocessing, the correlation coefficient (CC) method, and synergy interval partial least squares (siPLS) algorithms to propose CC-SiPLS and CC-SG-SiPLS methods. The results of applying full-spectrum partial least squares (PLS), correlation coefficient partial least squares (CC-PLS), synergy interval partial least squares (SiPLS), CC-SiPLS, and CC-SG-SiPLS methods to the near-infrared spectral wavelength selection were compared. The results showed that the mathematical models established from the spectral data after wavelength selection using CC, SiPLS, CC-SiPLS, and CC-SG-SiPLS were simplified, and the numbers of wavelengths were 33.6% (CC) and 14.3% (SiPLS), 11.1% (CC-SiPLS), and 6.3% (CC-SG-SiPLS) of that using the full spectrum. The accuracy of predicting the oil content of corn was improved compared to PLS. The CC-SG-SIPLS wavelength selection algorithm combined with the preprocessing method reduced the number of wavelengths from 700 to 44 and the model complexity was the most simplified. The root mean square error in prediction (RMSEP) and relative percent deviation (RPD) were 0.0552 and 2.5706, respectively, demonstrating adequate prediction accuracy. This result indicates that a combination strategy provides an effective way for multiple waveband selection, and that CC-SG-SiPLS can provide high analysis accuracy using molecular absorption bands composed of several wavelength intervals. Thus, this algorithm is an effective and robust wavelength selection strategy.
引用
收藏
页码:3108 / 3116
页数:9
相关论文
共 27 条
  • [1] Near-Infrared Spectrum Detection of Wheat Gluten Protein Content Based on a Combined Filtering Method
    Cai, Jian-Hua
    [J]. JOURNAL OF AOAC INTERNATIONAL, 2017, 100 (05) : 1565 - 1568
  • [2] Chen LY, 2018, ANAL METHODS-UK, V10, P667, DOI [10.1039/c7ay02488f, 10.1039/C7AY02488F]
  • [3] Correlation coefficient optimization in partial least-squares regression with application to ATR-FTIR spectroscopic analysis
    Chen, Yifang
    Chen, Jiemei
    Pan, Tao
    Han, Yun
    Yao, Lijun
    [J]. ANALYTICAL METHODS, 2015, 7 (14) : 5780 - 5786
  • [4] Review of Monitoring Petroleum-Hydrocarbon Contaminated Soils with Visible and Near-Infrared Spectroscopy
    Chen Zhi-li
    Yin Wen-qi
    Liu Hong-tao
    Liu Qiang
    Yang Yi
    [J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2017, 37 (06) : 1723 - 1727
  • [5] Chu XL, 2004, PROG CHEM, V16, P528
  • [6] PLS, iPLS, GA-PLS models for soluble solids content, pH and acidity determination in intact dovyalis fruit using near-infrared spectroscopy
    de Assis, Mateus W. D.
    De Fusco, Deborah O.
    Costa, Rosangela C.
    de Lima, Kassio M. G.
    Cunha Junior, Luis C.
    de Almeida Teixeira, Gustavo H.
    [J]. JOURNAL OF THE SCIENCE OF FOOD AND AGRICULTURE, 2018, 98 (15) : 5750 - 5755
  • [7] Rapid measurement of total non-structural carbohydrate concentration in grapevine trunk and leaf tissues using near infrared spectroscopy
    De Bei, R.
    Fuentes, S.
    Sullivan, W.
    Edwards, E. J.
    Tyerman, S.
    Cozzolino, D.
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2017, 136 : 176 - 183
  • [8] Simultaneous determination of aspartame, cyclamate, saccharin and acesulfame-K in powder tabletop sweeteners by FT-Raman spectroscopy associated with the multivariate calibration: PLS, iPLS and siPLS models were compared
    Duarte, Lucas M.
    Paschoal, Diego
    Izumi, Celly M. S.
    Dolzan, Maressa D.
    Alves, Victor R.
    Micke, Gustavo A.
    Dos Santos, Helix F.
    de Oliveira, Marcone A. L.
    [J]. FOOD RESEARCH INTERNATIONAL, 2017, 99 : 106 - 114
  • [9] Use of Fourier transform near-infrared spectroscopy combined with a relevance vector machine to discriminate Tetrastigma hemsleyanum (Sanyeqing) from other related species
    Fu, Caili
    Li, Ying
    Wang, Wu
    Qiu, Bin
    Lin, Zhenyu
    Wang, Shaoyun
    Wang, Suhua
    Asiri, Abdullah M.
    Alamry, Khalid A.
    [J]. ANALYTICAL METHODS, 2017, 9 (27) : 4023 - 4027
  • [10] The successive projections algorithm for interval selection in PLS
    Gomes, Adriano de Araujo
    Harrop Galvao, Roberto Kawakami
    Ugulino de Araujo, Mario Cesar
    Veras, Germano
    da Silva, Edvan Cirino
    [J]. MICROCHEMICAL JOURNAL, 2013, 110 : 202 - 208