Near-Infrared Wavelength-Selection Method Based on Joint Mutual Information and Weighted Bootstrap Sampling

被引:26
作者
Wang, Kai [1 ,2 ]
Du, Wenli [1 ,2 ]
Long, Jian [1 ,2 ]
机构
[1] East China Univ Sci & Technol, Sch Informat Sci & Engn, Minist Educ, Shanghai 200237, Peoples R China
[2] East China Univ Sci & Technol, Key Lab Adv Control & Optimizat Chem Proc, Minist Educ, Shanghai 200237, Peoples R China
基金
中国国家自然科学基金;
关键词
Mutual information; Random variables; Entropy; Predictive models; Spectroscopy; Indexes; Input variables; near infrared (NIR) model; wavelength selection; weighted bootstrap sampling; PARTIAL LEAST-SQUARES; VARIABLE SELECTION; CALIBRATION MODELS; REGRESSION-MODEL; NIR SPECTROSCOPY; ELIMINATION; ALGORITHM; OPTIMIZES;
D O I
10.1109/TII.2020.2972351
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Near-infrared (NIR) spectroscopy is widely used to estimate product quality and other key variables. Eliminating redundant variables is very important in constructing a high-quality NIR model. This article proposes a new wavelength-selection method for NIR spectroscopy based on joint mutual information and weighted bootstrap sampling (WBS). The method considers the combination effect of variables and involves the dynamic selection of wavelength in each iteration to increase the model prediction accuracy. The index based on joint mutual information is used to determine the importance of variables and thus accurately reflects the variable-combination effect. WBS is further used to dynamically adjust the importance of candidate variables, i.e., to increase the weights of samples with poor prediction results and decrease those of samples with accurate prediction. This process ensures that the subsequently selected wavelength focuses on inaccurately estimated samples. The performance of this method is demonstrated through three NIR datasets of gasoline, shootout, and diesel fuels. The proposed method is found to have better accuracy than the traditional partial-least-squares method, variable iterative space shrinkage approach, and several other wavelength-selection methods.
引用
收藏
页码:5884 / 5894
页数:11
相关论文
共 29 条
[11]   A new method for wavelength interval selection that intelligently optimizes the locations, widths and combinations of the intervals [J].
Deng, Bai-Chuan ;
Yun, Yong-Huan ;
Ma, Pan ;
Lin, Chen-Chen ;
Ren, Da-Bing ;
Liang, Yi-Zeng .
ANALYST, 2015, 140 (06) :1876-1885
[12]   A novel variable selection approach that iteratively optimizes variable space using weighted binary matrix sampling [J].
Deng, Bai-chuan ;
Yun, Yong-huan ;
Liang, Yi-zeng ;
Yi, Lun-zhao .
ANALYST, 2014, 139 (19) :4836-4845
[13]  
GILBERT V, 2016, IEEE PHOTONICS J, V8
[14]   Practical Determination of Solid Fat Content in Fats and Oils by Single-Wavelength Near-Infrared Analysis [J].
Grossi, Marco ;
Valli, Enrico ;
Glicerina, Virginia Teresa ;
Rocculi, Pietro ;
Toschi, Tullia Gallina ;
Ricco, Bruno .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2020, 69 (02) :585-592
[15]   A novel adaptive algorithm with near-infrared spectroscopy and its application in online gasoline blending processes [J].
He, Kaixun ;
Qian, Feng ;
Cheng, Hui ;
Du, Wenli .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2015, 140 :117-125
[16]   Online updating of NIR model and its industrial application via adaptive wavelength selection and local regression strategy [J].
He, Kaixun ;
Cheng, Hui ;
Du, Wenli ;
Qian, Feng .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2014, 134 :79-88
[17]   On a partial least squares regression model for asymmetric data with a chemical application in mining [J].
Huerta, Mauricio ;
Leiva, Victor ;
Liu, Shuangzhe ;
Rodriguez, Marcelo ;
Villegas, Danny .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2019, 190 :55-68
[18]   A PLS regression model using NIR spectroscopy for on-line monitoring of the biodiesel production reaction [J].
Killner, Mario H. M. ;
Rohwedder, Jarbas J. R. ;
Pasquini, Celio .
FUEL, 2011, 90 (11) :3268-3273
[19]  
Leardi R., 2010, J CHEMOMETR, V14, P643, DOI DOI 10.1002/1099-128X(200009/12)14:5/6
[20]   Key wavelengths screening using competitive adaptive reweighted sampling method for multivariate calibration [J].
Li, Hongdong ;
Liang, Yizeng ;
Xu, Qingsong ;
Cao, Dongsheng .
ANALYTICA CHIMICA ACTA, 2009, 648 (01) :77-84