A selective ensemble preprocessing strategy for near-infrared spectral quantitative analysis of complex samples

被引:94
作者
Bian, Xihui [1 ,2 ]
Wang, Kaiyi [1 ]
Tan, Erxuan [3 ]
Diwu, Pengyao [1 ]
Zhang, Fei [1 ]
Guo, Yugao [1 ]
机构
[1] Tiangong Univ, Sch Chem & Chem Engn, State Key Lab Separat Membranes & Membrane Proc, Tianjin 300387, Peoples R China
[2] Radboud Univ Nijmegen, IMM, Dept Analyt Chem, PV 61,POB 9010, NL-6500 GL Nijmegen, Netherlands
[3] Qinghai Univ, Sch Chem Engn, Xining 810016, Peoples R China
关键词
Preprocessing method; Ensemble; Full factorial design; Multivariate calibration; Near-infrared spectroscopy; Partial least squares; MULTIVARIATE CALIBRATION; SPECTROSCOPY; IDENTIFICATION; MODELS; DISCRIMINATION; WAVELET;
D O I
10.1016/j.chemolab.2019.103916
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Preprocessing of raw near-infrared (NIR) spectra is typically required prior to multivariate calibration since the measured spectra of complex samples are often subject to overwhelming background, light scattering, varying noises and other unexpected factors. Various preprocessing methods have been developed aimed at removing or reducing the interference of these effects. However, it is usually difficult to determine the best preprocessing method for a given data. Instead of selecting the best one, a selective ensemble preprocessing strategy is proposed for NIR spectral quantitative analysis. Firstly, numerous preprocessing methods and their combinations are obtained by full factorial design in order of baseline correction, scattering correction, smoothing and scaling. Then partial least squares (PLS) model is built for each preprocessing method. The models which have better predictions than PLS are selected and their predictions are averaged as the final prediction. The performance of the proposed method was tested with corn, blood and edible blend oil samples. Results demonstrate that the selective ensemble preprocessing method can give comparative or even better results than the traditional selected best preprocessing method. Therefore, in the framework of selective ensemble preprocessing, more accurate calibration can be obtained without searching the best preprocessing method.
引用
收藏
页数:8
相关论文
共 49 条
[1]   DBPPred-PDSD: Machine learning approach for prediction of DNA-binding proteins using Discrete Wavelet Transform and optimized integrated features space [J].
Ali, Farman ;
Kabir, Muhammad ;
Arif, Muhammad ;
Swati, Zar Nawab Khan ;
Khan, Zaheer Ullah ;
Ullah, Matee ;
Yu, Dong-Jun .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2018, 182 :21-30
[2]   STANDARD NORMAL VARIATE TRANSFORMATION AND DE-TRENDING OF NEAR-INFRARED DIFFUSE REFLECTANCE SPECTRA [J].
BARNES, RJ ;
DHANOA, MS ;
LISTER, SJ .
APPLIED SPECTROSCOPY, 1989, 43 (05) :772-777
[3]   A local pre-processing method for near-infrared spectra, combined with spectral segmentation and standard normal variate transformation [J].
Bi, Yiming ;
Yuan, Kailong ;
Xiao, Weiqiang ;
Wu, Jizhong ;
Shi, Chunyun ;
Xia, Jun ;
Chu, Guohai ;
Zhang, Guangxin ;
Zhou, Guojun .
ANALYTICA CHIMICA ACTA, 2016, 909 :30-40
[4]   Spectral quantitative analysis of complex samples based on the extreme learning machine [J].
Bian, Xi-Hui ;
Li, Shu-Juan ;
Fan, Meng-Ran ;
Guo, Yu-Gao ;
Chang, Na ;
Wang, Jiang-Jiang .
ANALYTICAL METHODS, 2016, 8 (23) :4674-4679
[5]   Ensemble calibration for the spectral quantitative analysis of complex samples [J].
Bian, Xihui ;
Diwu, Pengyao ;
Liu, Yirui ;
Liu, Peng ;
Li, Qian ;
Tan, Xiaoyao .
JOURNAL OF CHEMOMETRICS, 2018, 32 (11)
[6]   Robust boosting neural networks with random weights for multivariate calibration of complex samples [J].
Bian, Xihui ;
Diwu, Pengyao ;
Zhang, Caixia ;
Lin, Ligang ;
Chen, Guohui ;
Tan, Xiaoyao ;
Guo, Yugao ;
Cheng, Bowen .
ANALYTICA CHIMICA ACTA, 2018, 1009 :20-26
[7]   Rapid identification of milk samples by high and low frequency unfolded partial least squares discriminant analysis combined with near-infrared spectroscopy [J].
Bian, Xihui ;
Zhang, Caixia ;
Liu, Peng ;
Wei, Junfu ;
Tan, Xiaoyao ;
Lin, Ligang ;
Chang, Na ;
Guo, Yugao .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2017, 170 :96-101
[8]   High and low frequency unfolded partial least squares regression based on empirical mode decomposition for quantitative analysis of fuel oil samples [J].
Bian, Xihui ;
Li, Shujuan ;
Lin, Ligang ;
Tan, Xiaoyao ;
Fan, Qingjie ;
Li, Ming .
ANALYTICA CHIMICA ACTA, 2016, 925 :16-22
[9]   Rapid Determination of Metabolites in Bio-fluid Samples by Raman Spectroscopy and Optimum Combinations of Chemometric Methods [J].
Bian Xihui ;
Chen Da ;
Cai Wensheng ;
Grant, Edward ;
Shao Xueguang .
CHINESE JOURNAL OF CHEMISTRY, 2011, 29 (11) :2525-2532
[10]  
Brown C.D., 2000, THESIS