A hybrid variable selection method combining Fisher's linear discriminant combined population analysis and an improved binary cuckoo search algorithm

被引:2
作者
Chen, Shuobo [1 ]
Du, Kang [1 ]
Shan, Baoming [1 ]
Xu, Qilei [1 ]
Zhang, Fangkun [1 ]
机构
[1] Qingdao Univ Sci & Technol, Coll Automat & Elect Engn, Qingdao 266061, Peoples R China
基金
中国国家自然科学基金;
关键词
SOFT SHRINKAGE APPROACH; LEAST-SQUARES REGRESSION; MODEL; SPECTROSCOPY;
D O I
10.1039/d3ay01942j
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
In this paper, a novel hybrid variable selection method for model building by near-infrared (NIR) spectroscopy is proposed for composition measurement in industrial processes. A double-layer structure is designed for variable selection by combining Fisher's linear discriminant combined population analysis (FCPA) and an improved binary cuckoo search algorithm (IBCS). The Fisher classifier combined with model population analysis is used to select the variable interval wherein the useful variables are roughly located even when strong multicollinearity exists among spectral variables. Opposition-based learning (OBL) and jumping genes (JG) are introduced to improve the binary cuckoo search algorithm for the fine selection of key variables, thus avoiding the loss of excellent solutions due to randomness and the local optimum. Different variable selection methods were used to select variables for beer, corn, and diesel fuel datasets, and the partial least squares (PLS) algorithms were used to build calibration models to predict the original extract concentration of beer, the protein and starch content of corn, and the boiling point of diesel fuel, respectively. The results showed that the proposed PLS modeling method based on FCPA-IBCS has higher fitting accuracy and smaller prediction errors. In this paper, a novel hybrid variable selection method for model building by near-infrared (NIR) spectroscopy is proposed for composition measurement in industrial processes.
引用
收藏
页码:1021 / 1033
页数:13
相关论文
共 36 条
  • [1] Characterization of connective tissues using near-infrared spectroscopy and imaging
    Afara, Isaac O.
    Shaikh, Rubina
    Nippolainen, Ervin
    Querido, William
    Torniainen, Jari
    Sarin, Jaakko K.
    Kandel, Shital
    Pleshko, Nancy
    Toyras, Juha
    [J]. NATURE PROTOCOLS, 2021, 16 (02) : 1297 - 1329
  • [2] An integrated approach to the simultaneous selection of variables, mathematical pre-processing and calibration samples in partial least-squares multivariate calibration
    Allegrini, Franco
    Olivieri, Alejandro C.
    [J]. TALANTA, 2013, 115 : 755 - 760
  • [3] The principal problem with principal components regression
    Artigue, Heidi
    Smith, Gary
    [J]. COGENT MATHEMATICS & STATISTICS, 2019, 6
  • [4] Discretized butterfly optimization algorithm for variable selection in the rapid determination of cholesterol by near-infrared spectroscopy
    Bian, Xihui
    Zhao, Zizhen
    Liu, Jianwen
    Liu, Peng
    Shi, Huibing
    Tan, Xiaoyao
    [J]. ANALYTICAL METHODS, 2023, 15 (39) : 5190 - 5198
  • [5] Near infrared spectroscopic variable selection by a novel swarm intelligence algorithm for rapid quantification of high order edible blend oil
    Bian, Xihui
    Zhang, Rongling
    Liu, Peng
    Xiang, Yang
    Wang, Shuyu
    Tan, Xiaoyao
    [J]. SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2023, 284
  • [6] A bootstrapping soft shrinkage approach for variable selection in chemical modeling
    Deng, Bai-Chuan
    Yun, Yong-Huan
    Cao, Dong-Sheng
    Yin, Yu-Long
    Wang, Wei-Ting
    Lu, Hong-Mei
    Luo, Qian-Yi
    Liang, Yi-Zeng
    [J]. ANALYTICA CHIMICA ACTA, 2016, 908 : 63 - 74
  • [7] A new method for wavelength interval selection that intelligently optimizes the locations, widths and combinations of the intervals
    Deng, Bai-Chuan
    Yun, Yong-Huan
    Ma, Pan
    Lin, Chen-Chen
    Ren, Da-Bing
    Liang, Yi-Zeng
    [J]. ANALYST, 2015, 140 (06) : 1876 - 1885
  • [8] Identification of informative spectral ranges for predicting major chemical constituents in corn using NIR spectroscopy
    Fatemi, Ali
    Singh, Vijay
    Kamruzzaman, Mohammed
    [J]. FOOD CHEMISTRY, 2022, 383
  • [9] A Bootstrapping Soft Shrinkage Approach and Interval Random Variables Selection Hybrid Model for Variable Selection in Near-Infrared Spectroscopy
    Gamal Al-Kaf, Hasan Ali
    Mohammed Alduais, Nayef Abdulwahab
    Saad, Abdul-Malik H. Y.
    Chia, Kim Seng
    Mohsen, Abdulqader M.
    Alhussian, Hitham
    Haidar Mahdi, Ammar Abdo Mohammed
    Wan Salam, Wan Saiful-Islam
    [J]. IEEE ACCESS, 2020, 8 : 168036 - 168052
  • [10] García S, 2008, J MACH LEARN RES, V9, P2677