A stochastic algorithm for feature selection in pattern recognition

被引:0
作者
Gadat, Sebastien
Younes, Laurent
机构
[1] ENS Cachan, CMLA, F-94235 Cachan, France
[2] Johns Hopkins Univ, Ctr Imaging Sci, Baltimore, MD 21218 USA
关键词
stochastic learning algorithms; Robbins-Monro application; pattern recognition; classification algorithm; feature selection;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We introduce a new model addressing feature selection from a large dictionary of variables that can be computed from a signal or an image. Features are extracted according to an efficiency criterion, on the basis of specified classification or recognition tasks. This is done by estimating a probability distribution P on the complete dictionary, which distributes its mass over the more efficient, or informative, components. We implement a stochastic gradient descent algorithm, using the probability as a state variable and optimizing a multi-task goodness of fit criterion for classifiers based on variable randomly chosen according to P. We then generate classifiers from the optimal distribution of weights learned on the training set. The method is first tested on several pattern recognition problems including face detection, handwritten digit recognition, spam classification and micro-array analysis. We then compare our approach with other step-wise algorithms like random forests or recursive feature elimination.
引用
收藏
页码:509 / 547
页数:39
相关论文
共 52 条
[41]  
SCHOLKOPF B, 1995, P 1 INT C KNOWL DISC, P252
[42]  
Scholkopf B., 2001, Learning with kernels: support vector machines, regularization, optimization, and beyond
[43]  
SIMARD P, 1998, P 12 INT C PATT REC, P262
[44]  
Singhi S. K., 2006, Proceedings of the 23rd international conference on machine learning, DOI [DOI 10.1145/1143844.1143951, 10.1145/1143844.1143951]
[45]  
Sun Y, 2006, P 23 INT C MACH LEAR, DOI DOI 10.1145/1143844.1143959
[46]  
VAPNIK V, 2002, STAT ENG INFORM SCI
[47]  
Vapnik V.N., 1998, Adaptive and Learning Systems for Signal Processing, Communications, and Control
[48]  
Weston J, 2001, ADV NEUR IN, V13, P668
[49]  
Weston J., 2003, Journal of Machine Learning Research, V3, P1439, DOI 10.1162/153244303322753751
[50]  
WU Z, 2006, FEATURE EXTRACTION F