An efficient wavelength selection method based on the maximal information coefficient for multivariate spectral calibration

被引:12
作者
Huang, Xin [1 ]
Luo, Yi-Ping [1 ]
Xia, Li [2 ]
机构
[1] Hunan City Univ, Dept Stat, Yiyang 413000, Peoples R China
[2] Hunan City Univ, Sch Chem & Environm Engn, Yiyang 413000, Peoples R China
关键词
Multivariate calibration; Maximal information coefficient; PLS regression; Wavelength selection; LEAST-SQUARES REGRESSION; VARIABLE SELECTION; ELIMINATION;
D O I
10.1016/j.chemolab.2019.103872
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Spectral data on the modern spectroscopic instrument are commonly of high co-linearity and contain a large number of spectral variables, which may cause the poor predictive performance of the developed model. To address this problem, a novel method for wavelength selection, named maximal information coefficient screening combined with PLS regression (MICPLS), is proposed. MIC can capture a wide range of relationships between feature variables and target variable, including both functional and non-functional relationships. By employing the simple but effective MIC screening, MICPLS can rapidly and efficiently select strongly correlated spectral variables related to the response. The performance of MICPLS is investigated with two real spectroscopic datasets. Compared with commonly used methods for variable selection based on informative criterions including variable importance in projection (VIP) and selectivity ratio (SR), the results show the proposed method can improve the prediction performance, which is a good and promising strategy for variable selection in spectral analysis.
引用
收藏
页数:6
相关论文
共 30 条
  • [1] Variable selection in regression-a tutorial
    Andersen, C. M.
    Bro, R.
    [J]. JOURNAL OF CHEMOMETRICS, 2010, 24 (11-12) : 728 - 737
  • [2] A variable selection method based on uninformative variable elimination for multivariate calibration of near-infrared spectra
    Cai, Wensheng
    Li, Yankun
    Shao, Xueguang
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2008, 90 (02) : 188 - 194
  • [3] Elimination of uninformative variables for multivariate calibration
    Centner, V
    Massart, DL
    deNoord, OE
    deJong, S
    Vandeginste, BM
    Sterna, C
    [J]. ANALYTICAL CHEMISTRY, 1996, 68 (21) : 3851 - 3858
  • [4] A novel variable selection approach that iteratively optimizes variable space using weighted binary matrix sampling
    Deng, Bai-chuan
    Yun, Yong-huan
    Liang, Yi-zeng
    Yi, Lun-zhao
    [J]. ANALYST, 2014, 139 (19) : 4836 - 4845
  • [5] Fan Jianqing, 2006, ARXIVMATH0602133
  • [6] McTwo: a two-step feature selection algorithm based on maximal information coefficient
    Ge, Ruiquan
    Zhou, Manli
    Luo, Youxi
    Meng, Qinghan
    Mai, Guoqin
    Ma, Dongli
    Wang, Guoqing
    Zhou, Fengfeng
    [J]. BMC BIOINFORMATICS, 2016, 17
  • [7] Using elastic net regression to perform spectrally relevant variable selection
    Giglio, Cannon
    Brown, Steven D.
    [J]. JOURNAL OF CHEMOMETRICS, 2018, 32 (08)
  • [8] Hasegawa T., 2001, HDB VIBRATIONAL SPEC, P2293
  • [9] Screening for linearly and nonlinearly related variables in predictive cheminformatic models
    Hemmateenejad, Bahram
    Baumann, Knut
    [J]. JOURNAL OF CHEMOMETRICS, 2018, 32 (04)
  • [10] Hopkins D.W., 2003, NIR news, V14, P10, DOI DOI 10.1255/NIRN.735