An Variable Selection Method of the Significance Multivariate Correlation Competitive Population Analysis for Near-Infrared Spectroscopy in Chemical Modeling
被引:8
|
作者:
Wang, Yuxi
论文数: 0引用数: 0
h-index: 0
机构:
Xinjiang Univ, Coll Informat Sci & Engn, Urumqi 830046, Peoples R ChinaXinjiang Univ, Coll Informat Sci & Engn, Urumqi 830046, Peoples R China
Wang, Yuxi
[1
]
Jia, Zhenhong
论文数: 0引用数: 0
h-index: 0
机构:
Xinjiang Univ, Coll Informat Sci & Engn, Urumqi 830046, Peoples R ChinaXinjiang Univ, Coll Informat Sci & Engn, Urumqi 830046, Peoples R China
Jia, Zhenhong
[1
]
Yang, Jie
论文数: 0引用数: 0
h-index: 0
机构:
Shanghai Jiao Tong Univ, Inst Image Proc & Pattern Recognit, Shanghai 200240, Peoples R ChinaXinjiang Univ, Coll Informat Sci & Engn, Urumqi 830046, Peoples R China
Yang, Jie
[2
]
机构:
[1] Xinjiang Univ, Coll Informat Sci & Engn, Urumqi 830046, Peoples R China
[2] Shanghai Jiao Tong Univ, Inst Image Proc & Pattern Recognit, Shanghai 200240, Peoples R China
Spectrochemical analysis;
variable selection;
the significant multivariate correlation;
weighted bootstrap sampling;
model population analysis;
monte Carlo sampling;
analytical techniques;
partial least squares method;
PARTIAL LEAST-SQUARES;
REGRESSION;
SHRINKAGE;
CALIBRATION;
PROJECTION;
STRATEGY;
SPACE;
OPTIMIZATION;
PERSPECTIVE;
WAVELENGTHS;
D O I:
10.1109/ACCESS.2019.2954115
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
The high dimensionality of spectral datasets makes it difficult to select the optimal subset of variables. This paper presents a new method for variable selection called the significant multivariate competitive population analysis (SMCPA), Which combines ideas of significant multivariate correlation (SMC) and model population analysis, and employs weighted bootstrap sampling (WBS) and exponential decline function (EDF) competition methods. In this study, the values of SMC distributions are used as an index for evaluating the importance of each wavelength. Then, based on the importance level of each wavelength. SMCPA sequentially selects N subsets of spectral wavelengths by N Monte Carlo sampling in an iterative and competitive procedure. In each sampling run, a fixed ratio of samples is used to build a calibrated partial least-squares model, and then SMC is performed to obtain the score and threshold values. Next, based on the significant multivariate correlation scores, the key variables are selected by two steps: the compulsory selection of exponential decline function and the competitive selection of adaptive weighted sampling. Finally, cross-validation(CV) is applied to select the optimal subset with the lowest root mean square error. This method is tested on three NIR spectral datasets and compared against three high-performance variable selection methods. The experimental results show that the proposed algorithm has the highest efficiency and the best selection effect, and can usually locate the optimal combination of key wavelength variables in a dataset. The evaluation result after PLS modeling is also the best.
机构:
China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221116, Peoples R China
Huai An Vocat Coll Informat Technol, Huaian 223003, Peoples R ChinaChina Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221116, Peoples R China
Tian, Han
Li, Ming
论文数: 0引用数: 0
h-index: 0
机构:
China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221116, Peoples R ChinaChina Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221116, Peoples R China
Li, Ming
Wang, Yue
论文数: 0引用数: 0
h-index: 0
机构:
Huai An Vocat Coll Informat Technol, Huaian 223003, Peoples R ChinaChina Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221116, Peoples R China
Wang, Yue
Sheng, Dinggao
论文数: 0引用数: 0
h-index: 0
机构:
Huai An Vocat Coll Informat Technol, Huaian 223003, Peoples R ChinaChina Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221116, Peoples R China
Sheng, Dinggao
Liu, Jun
论文数: 0引用数: 0
h-index: 0
机构:
Huai An Vocat Coll Informat Technol, Huaian 223003, Peoples R ChinaChina Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221116, Peoples R China
Liu, Jun
Zhang, Linna
论文数: 0引用数: 0
h-index: 0
机构:
Huaiyin Inst Technol, Huaian 223003, Peoples R ChinaChina Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221116, Peoples R China