Identifying critical variables of principal components for unsupervised feature selection

被引:60
作者
Mao, KZ [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 2263, Singapore
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 2005年 / 35卷 / 02期
关键词
backward elimination; forward selection; principal components analysis (PCA); unsupervised feature selection;
D O I
10.1109/TSMCB.2004.843269
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Principal components analysis (PCA) is probably the best-known approach to unsupervised dimensionality reduction. However, axes of the lower-dimensional space, i.e., principal components (PCs), are a set of new variables carrying no clear physical meanings. Thus, interpretation of results obtained in the lower-dimensional PCA space and data acquisition for test samples still involve all of the original measurements. To deal with this problem, we develop two algorithms to link the physically meaningless PCs back to a subset of original measurements. The main idea of the algorithms is to evaluate and select feature subsets based on their capacities to reproduce sample projections on principal axes. The strength of the new algorithms is that the computaion complexity involved is significantly reduced, compared with the data structural similarity-based feature evaluation [20].
引用
收藏
页码:339 / 344
页数:6
相关论文
共 24 条
[1]  
[Anonymous], 2003, Statistical pattern recognition
[2]  
Blake C.L., 1998, UCI repository of machine learning databases
[3]   Dimensionality reduction of unsupervised data [J].
Dash, M ;
Liu, H ;
Yao, L .
NINTH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 1997, :532-539
[4]  
DEVANEY M., 1997, P 14 INT C MACH LEAR, P92
[5]  
Devijver P., 1982, PATTERN RECOGN
[6]  
DUCH W, DATASETS USED CLASSI
[7]  
Dy J. G., 2000, ICML '00, P247, DOI DOI 10.5555/645529.657797
[8]   Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring [J].
Golub, TR ;
Slonim, DK ;
Tamayo, P ;
Huard, C ;
Gaasenbeek, M ;
Mesirov, JP ;
Coller, H ;
Loh, ML ;
Downing, JR ;
Caligiuri, MA ;
Bloomfield, CD ;
Lander, ES .
SCIENCE, 1999, 286 (5439) :531-537
[9]   Feature selection: Evaluation, application, and small sample performance [J].
Jain, A ;
Zongker, D .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (02) :153-158
[10]   Statistical pattern recognition: A review [J].
Jain, AK ;
Duin, RPW ;
Mao, JC .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (01) :4-37