Support Vector Machines and the Bayes Rule in Classification

被引:0
作者
Yi Lin
机构
[1] University of Wisconsin,Department of Statistics
[2] Madison,undefined
来源
Data Mining and Knowledge Discovery | 2002年 / 6卷
关键词
support vector machine; classification; the Bayes rule; reproducing kernel; reproducing kernel Hilbert space; regularization methods;
D O I
暂无
中图分类号
学科分类号
摘要
The Bayes rule is the optimal classification rule if the underlying distribution of the data is known. In practice we do not know the underlying distribution, and need to “learn” classification rules from the data. One way to derive classification rules in practice is to implement the Bayes rule approximately by estimating an appropriate classification function. Traditional statistical methods use estimated log odds ratio as the classification function. Support vector machines (SVMs) are one type of large margin classifier, and the relationship between SVMs and the Bayes rule was not clear. In this paper, it is shown that the asymptotic target of SVMs are some interesting classification functions that are directly related to the Bayes rule. The rate of convergence of the solutions of SVMs to their corresponding target functions is explicitly established in the case of SVMs with quadratic or higher order loss functions and spline kernels. Simulations are given to illustrate the relation between SVMs and the Bayes rule in other cases. This helps understand the success of SVMs in many classification studies, and makes it easier to compare SVMs and traditional statistical methods.
引用
收藏
页码:259 / 275
页数:16
相关论文
共 10 条
[1]  
Burges C.J.C.(1998)A tutorial on support vector machines for pattern recognition Data Mining and Knowledge Discovery 2 121-167
[2]  
Cortes C.(1995)Support vector networks Machine Learning 20 273-297
[3]  
Vapnik V.N.(1990)Asymptotic analysis of penalized likelihood and related estimates The Annals of Statistics 18 1676-1695
[4]  
Cox D.D.(1997)On bias, variance, 0/1-loss, and the curse-of-dimensionality Data Mining and Knowledge Discovery 1 55-77
[5]  
O'sullivan F.(2000)Tensor product space ANOVA models The Annals of Statistics 28 734-755
[6]  
Friedman J.H.(2002)Support vector machines for classification in nonstandard situations Machine Learning 46 191-202
[7]  
Lin Y.(undefined)undefined undefined undefined undefined-undefined
[8]  
Lin Y.(undefined)undefined undefined undefined undefined-undefined
[9]  
Lee Y.(undefined)undefined undefined undefined undefined-undefined
[10]  
Wahba G.(undefined)undefined undefined undefined undefined-undefined