A three-stage wavelength selection algorithm for near-infrared spectroscopy calibration

被引:0
作者
Feng, Xi-Yao [1 ]
Chen, Zheng-Guang [1 ]
Yi, Shu-Juan [2 ]
Wang, Peng-Hui [3 ]
机构
[1] Heilongjiang Bayi Agr Univ, Coll Informat & Elect Engn, Daqing 163319, Peoples R China
[2] Heilongjiang Bayi Agr Univ, Coll Engn, Daqing 163319, Peoples R China
[3] Daqing Oilfield Environm Monitoring Stn, Daqing 163319, Peoples R China
基金
中国国家自然科学基金;
关键词
Near-infrared spectroscopy; Wavelength selection; Correlation coefficient; Stepwise regression; PARTIAL LEAST-SQUARES; VARIABLE SELECTION; REGRESSION; MODEL;
D O I
10.1016/j.saa.2024.125029
中图分类号
O433 [光谱学];
学科分类号
0703 ; 070302 ;
摘要
The near-infrared spectral data is highly high dimensional and contains redundant information, it is necessary to identify the most representative characteristic wavelengths before modeling to improve model accuracy and reliability. At present, there are many methods for selecting the characteristic wavelengths of NIR spectroscopy, but the collinearity among wavelengths is still a main issue that leads to poor model effects. Therefore, this study proposes a three-stage wavelength selection algorithm (Stage III) to reduce redundancy in NIR spectral data and collinearity between wavelength variables, resulting in a simpler and more accurate predictive model. The research uses a public NIR data set of corn samples as its subject. Initially, the wavelengths with the higher correlation coefficients are chosen after calculating the relationship coefficients between every wavelength vector and the concentration vector. On this basis, the correlation coefficients between the vectors of each wavelength point are calculated, and those wavelength points with smaller correlation coefficients with other wavelength points are selected. Ultimately, the stepwise regression analysis selects the wavelengths that provide substantial value to the model as the variables for modeling, leading to the development of a multiple linear regression model. The results show that the model using the three-stage wavelength selection algorithm outperforms those using the full spectrum, Stages I and Stage II, and the coefficient of determination of the test set of the Stage III-MLR model achieved an accuracy of 0.9360. Instead of the successive projections algorithm (SPA), uninformative variable elimination (UVE), and competitive adaptive reweighted sampling (CARS), Stage III is
引用
收藏
页数:9
相关论文
共 25 条
[1]   The successive projections algorithm for variable selection in spectroscopic multicomponent analysis [J].
Araújo, MCU ;
Saldanha, TCB ;
Galvao, RKH ;
Yoneyama, T ;
Chame, HC ;
Visani, V .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2001, 57 (02) :65-73
[2]   Elimination of uninformative variables for multivariate calibration [J].
Centner, V ;
Massart, DL ;
deNoord, OE ;
deJong, S ;
Vandeginste, BM ;
Sterna, C .
ANALYTICAL CHEMISTRY, 1996, 68 (21) :3851-3858
[3]  
Chen Bin Chen Bin, 2005, Transactions of the Chinese Society of Agricultural Engineering, V21, P99
[4]   A method for calibration and validation subset partitioning [J].
Galvao, RKH ;
Araujo, MCU ;
José, GE ;
Pontes, MJC ;
Silva, EC ;
Saldanha, TCB .
TALANTA, 2005, 67 (04) :736-740
[5]   PARTIAL LEAST-SQUARES REGRESSION - A TUTORIAL [J].
GELADI, P ;
KOWALSKI, BR .
ANALYTICA CHIMICA ACTA, 1986, 185 :1-17
[6]  
Harrell FE, 2015, SPRINGER SER STAT, P359, DOI 10.1007/978-3-319-19425-7_15
[7]   Using an optimal CC-PLSR-RBFNN model and NIR spectroscopy for the starch content determination in corn [J].
Jiang, Hao ;
Lu, Jiangang .
SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2018, 196 :131-140
[8]   Molecular spectroscopic wavelength selection using combined interval partial least squares and correlation coefficient optimization [J].
Jiang, Weiwei ;
Lu, Changhua ;
Zhang, Yujun ;
Ju, Wei ;
Wang, Jizhou ;
Xiao, Mingxia .
ANALYTICAL METHODS, 2019, 11 (24) :3108-3116
[9]   Leveraging multiple linear regression for wavelength selection [J].
Lemos, Tony ;
Kalivas, John H. .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2017, 168 :121-127
[10]   Key wavelengths screening using competitive adaptive reweighted sampling method for multivariate calibration [J].
Li, Hongdong ;
Liang, Yizeng ;
Xu, Qingsong ;
Cao, Dongsheng .
ANALYTICA CHIMICA ACTA, 2009, 648 (01) :77-84