A Novel Margin Based Algorithm for Feature Extraction

被引:0
作者
Cheng Yang
Liwei Wang
Jufu Feng
机构
[1] Peking University,Key Laboratory of Machine Perception, MOE, School of Electronics Engineering and Computer Science
来源
New Generation Computing | 2009年 / 27卷
关键词
Feature Extraction; Margin Maximization; Data Reweighting; Pattern Recognition;
D O I
暂无
中图分类号
学科分类号
摘要
Margin based feature extraction has become a hot topic in machine learning and pattern recognition. In this paper, we present a novel feature extraction method called Adaptive Margin Maximization (AMM) in which margin is defined to measure the discrimination ability of the features. The motivation comes principally from the iterative weight modification mechanism of the powerful boosting algorithms. In our AMM, the samples are dynamically weighted and features are learned sequentially. After one new feature is learned by maximizing the weighted total margin of data, the weights are updated so that the samples with smaller margins receive larger weights. The feature learned in the next round will thus try to concentrate more on these “hard” samples adaptively. We show that when the data are projected onto the feature space learned by AMM, most examples have large margins, and therefore the nearest neighbor classifier yields small generalization error. This is in contrast to existing margin maximization based feature extraction approaches, in which the goal is to maximize the total margin. Extensive experimental results on benchmark datasets demonstrate the effectiveness of our method.
引用
收藏
页码:285 / 305
页数:20
相关论文
共 47 条
[1]  
Belhumeur P.N.(1997)“Eigenfaces vs Fisherfaces: Recognition Using Class Specific Linear Projection” IEEE Trans. Pattern Anal. Mach. Intell. 19 711-720
[2]  
Hespanha P.(1998)“Boosting the margin: a new explanation for the effectiveness of voting methods” Ann. Stat 26 1651-1686
[3]  
Kriegman D.J.(2000)“A New LDA-based Face Recognition System Which Can Solve the Small Sample Size Problem”. Pattern Recognition 33 1713-1726
[4]  
Schapire R. E.(2004)“Generalizing Discriminant Analysis Using the Generalized Singular Value Decomposition” IEEE Trans. Pattern Anal. Mach. Intell. 26 995-1006
[5]  
Freund Y.(2007)“A Real Generalization of Discrete AdaBoost” Artificial Intelligence 171 25-41
[6]  
Bartlett P.(2007)“Fast Principal Component Analysis Using Fixed-point algorithm” Pattern Recognition Letters 28 1151-1155
[7]  
Lee W.S.(2007)“Dimensionality Reduction of Multimodal Labeled Data by Local Fisher Discriminant Analysis” J. Mach. Learn. Res. 8 1027-1061
[8]  
Chen L.(2009)“Distance Metric Learning for Large Margin Nearest Neighbor Classification” Journal of Machine Learning Research 10 207-244
[9]  
Liao H.(2005)“The equivalence of two-dimensional PCA to line-based PCA” Pattern Recognition Letters 26 57-60
[10]  
Ko M.(2006)“On Image Matrix Based Feature Extraction Algorithms” IEEE Trans. Systems Man and Cybernetics 36 194-197