Multicategory proximal support vector machine classifiers

被引:216
作者
Fung, GM
Mangasarian, OL
机构
[1] Siemens Med Solut Inc, Comp Aided Diag & Therapy Solut, Malvern, PA 19355 USA
[2] Univ Calif San Diego, Dept Math, La Jolla, CA 92093 USA
[3] Univ Wisconsin, Dept Comp Sci, Madison, WI 53706 USA
基金
美国国家科学基金会;
关键词
multicategory data classification; support vector machines; proximal classifiers;
D O I
10.1007/s10994-005-0463-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given a dataset, each element of which labeled by one of k labels, we construct by a very fast algorithm, a k-category proximal support vector machine (PSVM) classifier. Proximal support vector machines and related approaches (Fung & Mangasarian, 2001; Suykens & Vandewalle, 1999) can be interpreted as ridge regression applied to classification problems (Evgeniou, Pontil, & Poggio, 2000). Extensive computational results have shown the effectiveness of PSVM for two-class classification problems where the separating plane is constructed in time that can be as little as two orders of magnitude shorter than that of conventional support vector machines. When PSVM is applied to problems with more than two classes, the well known one-from-the-rest approach is a natural choice in order to take advantage of its fast performance. However, there is a drawback associated with this one-from-the-rest approach. The resulting two-class problems are often very unbalanced, leading in some cases to poor performance. We propose balancing the k classes and a novel Newton refinement modification to PSVM in order to deal with this problem. Computational results indicate that these two modifications preserve the speed of PSVM while often leading to significant test set improvement over a plain PSVM one-from-the-rest application. The modified approach is considerably faster than other one-from-the-rest methods that use conventional SVM formulations, while still giving comparable test set correctness.
引用
收藏
页码:77 / 97
页数:21
相关论文
共 37 条
[1]  
ANDERSON E, 1999, LAPACKS USERS GUIDE
[2]  
[Anonymous], US GUID
[3]  
Bennett K., 1993, OPTIM METHOD SOFTW, V3, P27
[4]  
BOTTOU L, 1994, INT C PATT RECOG, P77, DOI 10.1109/ICPR.1994.576879
[5]   Massive data discrimination via linear support vector machines [J].
Bradley, PS ;
Mangasarian, OL .
OPTIMIZATION METHODS & SOFTWARE, 2000, 13 (01) :1-10
[6]   Multicategory classification by support vector machines [J].
Bredensteiner, EJ ;
Bennett, KP .
COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 1999, 12 (1-3) :53-79
[7]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[8]   Hybrid misclassification minimization [J].
Chen, CH ;
Mangasarian, OL .
ADVANCES IN COMPUTATIONAL MATHEMATICS, 1996, 5 (2-3) :127-136
[9]  
Cherkassky V.S., 1998, LEARNING DATA CONCEP, V1st ed.
[10]  
*CPLEX OPT INC, 1992, US CPLEX TM LIN OPT