Comparison of algorithms that select features for pattern classifiers

被引:604
作者
Kudo, M
Sklansky, J
机构
[1] Hokkaido Univ, Grad Sch Engn, Div Syst & Informat Engn, Sapporo, Hokkaido 0608628, Japan
[2] Univ Calif Irvine, Dept Elect Engn, Irvine, CA 92697 USA
[3] Natl Sci Fdn, Japan US Cooperat Sci Program, Washington, DC 20550 USA
[4] Japan Soc Promot Sci, Tokyo, Japan
关键词
feature selection; monotonicity; genetic algorithms; leave-one-out method; k-nearest-neighbor method;
D O I
10.1016/S0031-3203(99)00041-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A comparative study of algorithms for large-scale feature selection (where the number of features is over 50) is carried out. In the study, the goodness of a feature subset is measured by leave-one-out correct-classification rate of a nearest-neighbor (1-NN) classifier and many practical problems are used. A unified way is given to compare algorithms having dissimilar objectives. Based on the results of many experiments, we give guidelines for the use of feature selection algorithms. Especially, it is shown that sequential floating search methods are suitable for small- and medium-scale problems and genetic algorithms are suitable for large-scale problems. (C) 1999 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:25 / 41
页数:17
相关论文
共 16 条
[1]  
[Anonymous], 1996, UCI REPOSITORY MACHI
[2]  
FERRI FJ, 1994, PATTERN RECOGN, V4, P403
[3]   FEATURE-SELECTION FOR AUTOMATIC CLASSIFICATION OF NON-GAUSSIAN DATA [J].
FOROUTAN, I ;
SKLANSKY, J .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1987, 17 (02) :187-198
[4]  
HOLZ HJ, 1994, PATTERN RECOGN, V4, P473
[5]   Feature selection: Evaluation, application, and small sample performance [J].
Jain, A ;
Zongker, D .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (02) :153-158
[6]  
Kittler J., 1978, Pattern Recognition and Signal Processing, P41
[7]   FEATURE-SELECTION BASED ON THE STRUCTURAL INDEXES OF CATEGORIES [J].
KUDO, M ;
SHIMBO, M .
PATTERN RECOGNITION, 1993, 26 (06) :891-901
[8]  
Kudo M., 1998, LECT NOTES COMPUTER, V1451, P548
[9]  
NARENDRA P, 1977, IEEE T COMPUT, V26, P917, DOI 10.1109/TC.1977.1674939
[10]   FLOATING SEARCH METHODS IN FEATURE-SELECTION [J].
PUDIL, P ;
NOVOVICOVA, J ;
KITTLER, J .
PATTERN RECOGNITION LETTERS, 1994, 15 (11) :1119-1125