A Subset Selection Algorithm for Multivariate Modeling Based on the Spectral Variations

被引:0
|
作者
Li, Zhe [1 ,2 ]
Feng, Jinchao [3 ]
Jia, Kebin [4 ]
机构
[1] Beijing Lab Adv Informat Networks, Beijing, Peoples R China
[2] Beijing Univ Technol, Fac Informat, Beijing, Peoples R China
[3] Beijing Univ Technol, Fac Informat, Beijing Key Lab Computat Intelligence & Intellige, Beijing, Peoples R China
[4] Beijing Univ Technol, Beijing Adv Innovat Ctr Future Internet Technol, Beijing, Peoples R China
来源
2018 2ND INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND BIOINFORMATICS (ICBEB 2018) | 2018年
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Sample subset selection; Kennard-Stone algorithm; SPXY algorithm; PLS regression; Multivariate calibration; NIR spectroscopy; NEAR-INFRARED SPECTROSCOPY; NEURAL-NETWORKS; NIR SPECTROSCOPY; PARTICLE-SIZE; CLASSIFICATION; TEMPERATURE; CALIBRATION; PREDICTION; VARIABLES; DESIGN;
D O I
10.1145/3278198.3278205
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper proposes a subset selection method, named sample set partitioning based on joint x-y-z distances (SPXYZ) algorithm, for multivariate modelling. The proposed method is a modified version of the original sample set partitioning based on joint x-y distances (SPXY) algorithm. The contributions from the dependent variable (z) space for parameters that cause the systematic error in measured spectra, including external factors and inherent characteristics, are added to the original SPXY algorithm. Here, the z differences denotes the variability in the dimension of external disturbances and inherent characteristics. Based on two real world datasets, SPXYZ is employed with partial least-squares (PLS) to demonstrate the advantages of subset selection by adding the contributions from external factor, i.e., temperature and inherent characteristic, i.e., background components. We compare the prediction performance of SPXYZ-PLS model with other three PLS models using random sampling (RS), Kennard-Stone (KS) and SPXY. The prediction performance results from experimental studies showed that the prediction performance of SPXYZ-PLS is significantly better than the other models. Therefore, the proposed method is an alternative method of subset selection for calibration modeling.
引用
收藏
页码:154 / 159
页数:6
相关论文
共 50 条
  • [1] A Spectral Wavelength Selection Algorithm Based on RMSECV Curve
    Zhou Yan
    Cao Hui
    Ju Lin-cang
    SPECTROSCOPY AND SPECTRAL ANALYSIS, 2011, 31 (02) : 492 - 495
  • [2] A novel variable selection algorithm based on neural network for near-infrared spectral modeling
    Zhang, Pengfei
    Xu, Zhuopin
    Ma, Huimin
    Zheng, Lei
    Li, Xiaohong
    Zhang, Zhiyi
    Wu, Yuejin
    Wang, Qi
    ANALYTICA CHIMICA ACTA, 2024, 1330
  • [3] Modeling for SSC and firmness detection of persimmon based on NIR hyperspectral imaging by sample partitioning and variables selection
    Wei, Xuan
    He, Jincheng
    Zheng, Shuhe
    Ye, Dapeng
    INFRARED PHYSICS & TECHNOLOGY, 2020, 105
  • [4] Research on Spectral Region Selection of Near Infrared Spectra Based on Genetic Algorithm
    Yong, Zhang
    2017 NINTH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC 2017), VOL 2, 2017, : 185 - 188
  • [5] Modeling of Soil Cation Exchange Capacity Based on Chemometrics, Various Spectral Transformations, and Multivariate Approaches in Some Soils of Arid Zones
    Mustafa, Abdel-Rahman A.
    Abdelsamie, Elsayed A.
    Mohamed, Elsayed Said
    Rebouh, Nazih Y.
    Shokr, Mohamed S.
    SUSTAINABILITY, 2024, 16 (16)
  • [6] An advanced ACO algorithm for feature subset selection
    Kashef, Shima
    Nezamabadi-pour, Hossein
    NEUROCOMPUTING, 2015, 147 : 271 - 279
  • [7] A wavelength selection method based on randomization test for near-infrared spectral analysis
    Xu, Heng
    Liu, Zhichao
    Cai, Wensheng
    Shao, Xueguang
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2009, 97 (02) : 189 - 193
  • [8] Wavelength Selection Algorithm Based on Minimum Correlation Coefficient for Multivariate Calibration
    Cheng Jie-hong
    Chen Zheng-guang
    Yi Shu-juan
    SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42 (03) : 719 - 725
  • [9] An efficient wavelength selection method based on the maximal information coefficient for multivariate spectral calibration
    Huang, Xin
    Luo, Yi-Ping
    Xia, Li
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2019, 194
  • [10] An efficient method of wavelength interval selection based on random frog for multivariate spectral calibration
    Yun, Yong-Huan
    Li, Hong-Dong
    Wood, Leslie R. E.
    Fan, Wei
    Wang, Jia-Jun
    Cao, Dong-Sheng
    Xu, Qing-Song
    Liang, Yi-Zeng
    SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2013, 111 : 31 - 36