Improved leaps and bounds variable selection algorithm based on principal component analysis

被引:1
作者
Zhang, Wenjun [1 ]
Wang, Xin [1 ]
Chen, Lin [1 ]
机构
[1] Hebei Univ Technol, Sch Chem Engn, Dept Appl Chem, Tianjin 300130, Peoples R China
关键词
Feature selection; Variable selection; Multiple linear regression; Leaps and bounds; FEATURE SUBSET-SELECTION; GENETIC ALGORITHMS; EVOLUTIONARY; QSAR;
D O I
10.1016/j.chemolab.2014.09.017
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a new variable selection algorithm is described, based on leaps and bounds regression. The algorithm removes the limit of the traditional algorithm that the descriptors must be less than the samples, by replacing the original variables in a subset evaluation with a small number of principal components. Two different sizes of variables data sets were employed to investigate the performance of the new algorithm. The result shows that the improved algorithm can obtain optimal or good sub-optimal subsets when a different number of principal components are used. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:76 / 83
页数:8
相关论文
共 40 条
[1]  
Al-Ani A., 2005, INT J COMPUTATIONAL, V2, P53
[2]  
[Anonymous], 1 AFR EUR C CHEM RAB
[3]  
[Anonymous], DRAGON 6 0 SOFTWARE
[4]  
[Anonymous], 1984, OLSHEN STONE CLASSIF, DOI 10.2307/2530946
[5]  
[Anonymous], 1989, MULTIVARIATE CALIBRA
[6]  
[Anonymous], 1995, 1995 IEEE INT C
[7]   The successive projections algorithm for variable selection in spectroscopic multicomponent analysis [J].
Araújo, MCU ;
Saldanha, TCB ;
Galvao, RKH ;
Yoneyama, T ;
Chame, HC ;
Visani, V .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2001, 57 (02) :65-73
[8]   Cross-validation as the objective function for variable-selection techniques [J].
Baumann, K .
TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 2003, 22 (06) :395-406
[9]   Benchmarking Ligand-Based Virtual High-Throughput Screening with the PubChem Database [J].
Butkiewicz, Mariusz ;
Lowe, Edward W., Jr. ;
Mueller, Ralf ;
Mendenhall, Jeffrey L. ;
Teixeira, Pedro L. ;
Weaver, C. David ;
Meiler, Jens .
MOLECULES, 2013, 18 (01) :735-756
[10]   Waveband selection for hyperspectral data: optimal feature selection [J].
Casasent, D ;
Chen, XW .
OPTICAL PATTERN RECOGNITION XIV, 2003, 5106 :259-270