A stochastic algorithm for feature selection in pattern recognition

被引:0
作者
Gadat, Sebastien
Younes, Laurent
机构
[1] ENS Cachan, CMLA, F-94235 Cachan, France
[2] Johns Hopkins Univ, Ctr Imaging Sci, Baltimore, MD 21218 USA
关键词
stochastic learning algorithms; Robbins-Monro application; pattern recognition; classification algorithm; feature selection;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We introduce a new model addressing feature selection from a large dictionary of variables that can be computed from a signal or an image. Features are extracted according to an efficiency criterion, on the basis of specified classification or recognition tasks. This is done by estimating a probability distribution P on the complete dictionary, which distributes its mass over the more efficient, or informative, components. We implement a stochastic gradient descent algorithm, using the probability as a state variable and optimizing a multi-task goodness of fit criterion for classifiers based on variable randomly chosen according to P. We then generate classifiers from the optimal distribution of weights learned on the training set. The method is first tested on several pattern recognition problems including face detection, handwritten digit recognition, spam classification and micro-array analysis. We then compare our approach with other step-wise algorithms like random forests or recursive feature elimination.
引用
收藏
页码:509 / 547
页数:39
相关论文
共 52 条
[1]   Shape quantization and recognition with randomized trees [J].
Amit, Y ;
Geman, D .
NEURAL COMPUTATION, 1997, 9 (07) :1545-1588
[2]   A computational model for visual selection [J].
Amit, Y ;
Geman, D .
NEURAL COMPUTATION, 1999, 11 (07) :1691-1715
[3]  
[Anonymous], 1990, APPL MATH
[4]  
[Anonymous], FEATURE EXTRACTION F
[5]   Convergence with probability one of stochastic approximation algorithms whose average is cooperative [J].
Benaïm, M .
NONLINEARITY, 2000, 13 (03) :601-616
[6]   A dynamical system approach to stochastic approximations [J].
Benaim, M .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1996, 34 (02) :437-472
[7]  
Bins J, 2001, EIGHTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOL II, PROCEEDINGS, P159, DOI 10.1109/ICCV.2001.937619
[8]  
Blake C.L., 1998, UCI repository of machine learning databases
[9]   TRAINING A 3-NODE NEURAL NETWORK IS NP-COMPLETE [J].
BLUM, AL ;
RIVEST, RL .
NEURAL NETWORKS, 1992, 5 (01) :117-127
[10]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32