A Subset Selection Algorithm for Multivariate Modeling Based on the Spectral Variations

被引:0
|
作者
Li, Zhe [1 ,2 ]
Feng, Jinchao [3 ]
Jia, Kebin [4 ]
机构
[1] Beijing Lab Adv Informat Networks, Beijing, Peoples R China
[2] Beijing Univ Technol, Fac Informat, Beijing, Peoples R China
[3] Beijing Univ Technol, Fac Informat, Beijing Key Lab Computat Intelligence & Intellige, Beijing, Peoples R China
[4] Beijing Univ Technol, Beijing Adv Innovat Ctr Future Internet Technol, Beijing, Peoples R China
来源
2018 2ND INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND BIOINFORMATICS (ICBEB 2018) | 2018年
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Sample subset selection; Kennard-Stone algorithm; SPXY algorithm; PLS regression; Multivariate calibration; NIR spectroscopy; NEAR-INFRARED SPECTROSCOPY; NEURAL-NETWORKS; NIR SPECTROSCOPY; PARTICLE-SIZE; CLASSIFICATION; TEMPERATURE; CALIBRATION; PREDICTION; VARIABLES; DESIGN;
D O I
10.1145/3278198.3278205
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper proposes a subset selection method, named sample set partitioning based on joint x-y-z distances (SPXYZ) algorithm, for multivariate modelling. The proposed method is a modified version of the original sample set partitioning based on joint x-y distances (SPXY) algorithm. The contributions from the dependent variable (z) space for parameters that cause the systematic error in measured spectra, including external factors and inherent characteristics, are added to the original SPXY algorithm. Here, the z differences denotes the variability in the dimension of external disturbances and inherent characteristics. Based on two real world datasets, SPXYZ is employed with partial least-squares (PLS) to demonstrate the advantages of subset selection by adding the contributions from external factor, i.e., temperature and inherent characteristic, i.e., background components. We compare the prediction performance of SPXYZ-PLS model with other three PLS models using random sampling (RS), Kennard-Stone (KS) and SPXY. The prediction performance results from experimental studies showed that the prediction performance of SPXYZ-PLS is significantly better than the other models. Therefore, the proposed method is an alternative method of subset selection for calibration modeling.
引用
收藏
页码:154 / 159
页数:6
相关论文
共 50 条
  • [21] A Stable Gene Subset Selection Algorithm for Cancers
    Xie, Juanying
    Gao, Hongchao
    HEALTH INFORMATION SCIENCE (HIS 2015), 2015, 9085 : 111 - 122
  • [22] A new spectral variable selection pattern using competitive adaptive reweighted sampling combined with successive projections algorithm
    Tang, Guo
    Huang, Yue
    Tian, Kuangda
    Song, Xiangzhong
    Yan, Hong
    Hu, Jing
    Xiong, Yanmei
    Min, Shungeng
    ANALYST, 2014, 139 (19) : 4894 - 4902
  • [23] A New Proposed Feature Subset Selection Algorithm Based on Maximization of Gain Ratio
    Nagpal, Arpita
    Gaur, Deepti
    BIG DATA ANALYTICS, BDA 2015, 2015, 9498 : 181 - 197
  • [24] Wrapper Feature Subset Selection for Dimension Reduction Based on Ensemble Learning Algorithm
    Panthong, Rattanawadee
    Srivihok, Anongnart
    THIRD INFORMATION SYSTEMS INTERNATIONAL CONFERENCE 2015, 2015, 72 : 162 - 169
  • [25] Joint mutual information-based input variable selection for multivariate time series modeling
    Han, Min
    Ren, Weijie
    Liu, Xiaoxin
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 37 : 250 - 257
  • [26] Spectral Multivariate Calibration with Wavelength Selection Using Variants of Tikhonov Regularization
    Ottaway, Joshua
    Kalivas, John H.
    Andries, Erik
    APPLIED SPECTROSCOPY, 2010, 64 (12) : 1388 - 1395
  • [27] Efficient wavenumber selection based on spectral fluctuation dividing and correlation-based clustering for calibration modeling
    Miyano, Takuya
    Fujiwara, Koichi
    Kano, Manabu
    Tanabe, Hideaki
    Nakagawa, Hiroshi
    Watanabe, Tomoyuki
    Minami, Hidemi
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2015, 148 : 85 - 94
  • [28] Feature subset selection by gravitational search algorithm optimization
    Han, XiaoHong
    Chang, XiaoMing
    Quan, Long
    Xiong, XiaoYan
    Li, JingXia
    Zhang, ZhaoXia
    Liu, Yi
    INFORMATION SCIENCES, 2014, 281 : 128 - 146
  • [29] MABUSE: A margin optimization based feature subset selection algorithm using boosting principles
    Garcia-Pedrajas, Nicolas
    Cerruela-Garcia, Gonzalo
    KNOWLEDGE-BASED SYSTEMS, 2022, 253
  • [30] Subset selection from large datasets for Kriging modeling
    Rennen, Gijs
    STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2009, 38 (06) : 545 - 569