Feature selection with kernel class separability

被引:154
作者
Wang, Lei [1 ]
机构
[1] Australian Natl Univ, Res Sch Informat Sci & Engn, Dept Informat Engn, Canberra, ACT 0200, Australia
基金
澳大利亚研究理事会;
关键词
kernel class separability; feature selection; Support Vector Machines; Kernel Fisher Discriminant Analysis; pattern classification;
D O I
10.1109/TPAMI.2007.70799
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classification can often benefit from efficient feature selection. However, the presence of linearly nonseparable data, quick response requirement, small sample problem, and noisy features makes the feature selection quite challenging. In this work, a class separability criterion is developed in a high-dimensional kernel space, and feature selection is performed by the maximization of this criterion. To make this feature selection approach work, the issues of automatic kernel parameter tuning, numerical stability, and regularization for multiparameter optimization are addressed. Theoretical analysis uncovers the relationship of this criterion to the radius- margin bound of the Support Vector Machines (SVMs), the Kernel Fisher Discriminant Analysis (KFDA), and the kernel alignment criterion, thus providing more insight into feature selection with this criterion. This criterion is applied to a variety of selection modes using different search strategies. Extensive experimental study demonstrates its efficiency in delivering fast and robust feature selection.
引用
收藏
页码:1534 / 1546
页数:13
相关论文
共 20 条
[1]  
AN LTH, 2002, P 1 INT WORKSH GLOB, P131
[2]  
Bo LF, 2006, NEURAL COMPUT, V18, P961, DOI 10.1162/089976606775774642
[3]   Efficient leave-one-out cross-validation of kernel Fisher discriminant classifiers [J].
Cawley, GC ;
Talbot, NLC .
PATTERN RECOGNITION, 2003, 36 (11) :2585-2592
[4]  
CHANG CC, 2001, LIBSVM LIB SUPPORT V, P15
[5]   Choosing multiple parameters for support vector machines [J].
Chapelle, O ;
Vapnik, V ;
Bousquet, O ;
Mukherjee, S .
MACHINE LEARNING, 2002, 46 (1-3) :131-159
[6]  
Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482
[7]  
Cristianini N, 2002, ADV NEUR IN, V14, P367
[8]   Approximate statistical tests for comparing supervised classification learning algorithms [J].
Dietterich, TG .
NEURAL COMPUTATION, 1998, 10 (07) :1895-1923
[9]  
Duda R.O., 2001, Pattern Classification, V2nd
[10]  
Fukunaga K., 1990, INTRO STAT PATTERN R