Near-Infrared Wavelength-Selection Method Based on Joint Mutual Information and Weighted Bootstrap Sampling

被引:26
作者
Wang, Kai [1 ,2 ]
Du, Wenli [1 ,2 ]
Long, Jian [1 ,2 ]
机构
[1] East China Univ Sci & Technol, Sch Informat Sci & Engn, Minist Educ, Shanghai 200237, Peoples R China
[2] East China Univ Sci & Technol, Key Lab Adv Control & Optimizat Chem Proc, Minist Educ, Shanghai 200237, Peoples R China
基金
中国国家自然科学基金;
关键词
Mutual information; Random variables; Entropy; Predictive models; Spectroscopy; Indexes; Input variables; near infrared (NIR) model; wavelength selection; weighted bootstrap sampling; PARTIAL LEAST-SQUARES; VARIABLE SELECTION; CALIBRATION MODELS; REGRESSION-MODEL; NIR SPECTROSCOPY; ELIMINATION; ALGORITHM; OPTIMIZES;
D O I
10.1109/TII.2020.2972351
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Near-infrared (NIR) spectroscopy is widely used to estimate product quality and other key variables. Eliminating redundant variables is very important in constructing a high-quality NIR model. This article proposes a new wavelength-selection method for NIR spectroscopy based on joint mutual information and weighted bootstrap sampling (WBS). The method considers the combination effect of variables and involves the dynamic selection of wavelength in each iteration to increase the model prediction accuracy. The index based on joint mutual information is used to determine the importance of variables and thus accurately reflects the variable-combination effect. WBS is further used to dynamically adjust the importance of candidate variables, i.e., to increase the weights of samples with poor prediction results and decrease those of samples with accurate prediction. This process ensures that the subsequently selected wavelength focuses on inaccurately estimated samples. The performance of this method is demonstrated through three NIR datasets of gasoline, shootout, and diesel fuels. The proposed method is found to have better accuracy than the traditional partial-least-squares method, variable iterative space shrinkage approach, and several other wavelength-selection methods.
引用
收藏
页码:5884 / 5894
页数:11
相关论文
共 29 条
[1]   A Tutorial on Near Infrared Spectroscopy and Its Calibration [J].
Agelet, Lidia Esteve ;
Hurburgh, Charles R., Jr. .
CRITICAL REVIEWS IN ANALYTICAL CHEMISTRY, 2010, 40 (04) :246-260
[2]   Error Covariance Penalized Regression: A novel multivariate model combining penalized regression with multivariate error structure [J].
Allegrini, Franco ;
Braga, Jez W. B. ;
Moreira, Alessandro C. O. ;
Olivieri, Alejandro C. .
ANALYTICA CHIMICA ACTA, 2018, 1011 :20-27
[3]   Mutual information-based feature selection for intrusion detection systems [J].
Amiri, Fatemeh ;
Yousefi, MohammadMahdi Rezaei ;
Lucas, Caro ;
Shakery, Azadeh ;
Yazdani, Nasser .
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2011, 34 (04) :1184-1199
[4]   Comparison of linear and nonlinear calibration models based on near infrared (NIR) spectroscopy data for gasoline properties prediction [J].
Balabin, Roman M. ;
Safieva, Ravilya Z. ;
Lomakina, Ekaterma I. .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2007, 88 (02) :183-188
[5]   Support vector machine regression (SVR/LS-SVM)-an alternative to neural networks (ANN) for analytical chemistry? Comparison of nonlinear methods on near infrared (NIR) spectroscopy data [J].
Balabin, Roman M. ;
Lomakina, Ekaterina I. .
ANALYST, 2011, 136 (08) :1703-1712
[6]   Neural network (ANN) approach to biodiesel analysis: Analysis of biodiesel density, kinematic viscosity, methanol and water contents using near infrared (NIR) spectroscopy [J].
Balabin, Roman M. ;
Lomakina, Ekaterina I. ;
Safieva, Ravilya Z. .
FUEL, 2011, 90 (05) :2007-2015
[7]   A variable selection method based on uninformative variable elimination for multivariate calibration of near-infrared spectra [J].
Cai, Wensheng ;
Li, Yankun ;
Shao, Xueguang .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2008, 90 (02) :188-194
[8]   Elimination of uninformative variables for multivariate calibration [J].
Centner, V ;
Massart, DL ;
deNoord, OE ;
deJong, S ;
Vandeginste, BM ;
Sterna, C .
ANALYTICAL CHEMISTRY, 1996, 68 (21) :3851-3858
[9]   Recursive Wavelength-Selection Strategy to Update Near-Infrared Spectroscopy Model with an Industrial Application [J].
Chen, Mulang ;
Khare, Swanand ;
Huang, Biao ;
Zhang, Haitao ;
Lau, Eric ;
Feng, Enbo .
INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2013, 52 (23) :7886-7895
[10]   A bootstrapping soft shrinkage approach for variable selection in chemical modeling [J].
Deng, Bai-Chuan ;
Yun, Yong-Huan ;
Cao, Dong-Sheng ;
Yin, Yu-Long ;
Wang, Wei-Ting ;
Lu, Hong-Mei ;
Luo, Qian-Yi ;
Liang, Yi-Zeng .
ANALYTICA CHIMICA ACTA, 2016, 908 :63-74