A Subset Selection Algorithm for Multivariate Modeling Based on the Spectral Variations
被引:0
|
作者:
Li, Zhe
论文数: 0引用数: 0
h-index: 0
机构:
Beijing Lab Adv Informat Networks, Beijing, Peoples R China
Beijing Univ Technol, Fac Informat, Beijing, Peoples R ChinaBeijing Lab Adv Informat Networks, Beijing, Peoples R China
Li, Zhe
[1
,2
]
Feng, Jinchao
论文数: 0引用数: 0
h-index: 0
机构:
Beijing Univ Technol, Fac Informat, Beijing Key Lab Computat Intelligence & Intellige, Beijing, Peoples R ChinaBeijing Lab Adv Informat Networks, Beijing, Peoples R China
Feng, Jinchao
[3
]
Jia, Kebin
论文数: 0引用数: 0
h-index: 0
机构:
Beijing Univ Technol, Beijing Adv Innovat Ctr Future Internet Technol, Beijing, Peoples R ChinaBeijing Lab Adv Informat Networks, Beijing, Peoples R China
Jia, Kebin
[4
]
机构:
[1] Beijing Lab Adv Informat Networks, Beijing, Peoples R China
[2] Beijing Univ Technol, Fac Informat, Beijing, Peoples R China
[3] Beijing Univ Technol, Fac Informat, Beijing Key Lab Computat Intelligence & Intellige, Beijing, Peoples R China
[4] Beijing Univ Technol, Beijing Adv Innovat Ctr Future Internet Technol, Beijing, Peoples R China
来源:
2018 2ND INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND BIOINFORMATICS (ICBEB 2018)
|
2018年
This paper proposes a subset selection method, named sample set partitioning based on joint x-y-z distances (SPXYZ) algorithm, for multivariate modelling. The proposed method is a modified version of the original sample set partitioning based on joint x-y distances (SPXY) algorithm. The contributions from the dependent variable (z) space for parameters that cause the systematic error in measured spectra, including external factors and inherent characteristics, are added to the original SPXY algorithm. Here, the z differences denotes the variability in the dimension of external disturbances and inherent characteristics. Based on two real world datasets, SPXYZ is employed with partial least-squares (PLS) to demonstrate the advantages of subset selection by adding the contributions from external factor, i.e., temperature and inherent characteristic, i.e., background components. We compare the prediction performance of SPXYZ-PLS model with other three PLS models using random sampling (RS), Kennard-Stone (KS) and SPXY. The prediction performance results from experimental studies showed that the prediction performance of SPXYZ-PLS is significantly better than the other models. Therefore, the proposed method is an alternative method of subset selection for calibration modeling.