A fast training algorithm for SVM via clustering technique and Gabriel graph

被引:0
作者
Li, Xia [1 ]
Wang, Na [1 ]
Li, Shu-Yuan [1 ]
机构
[1] Shenzhen Univ, Coll Informat Engn, Shenzhen 518060, Guangdong, Peoples R China
来源
ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF CONTEMPORARY INTELLIGENT COMPUTING TECHNIQUES | 2007年 / 2卷
关键词
support vector machine; fast training; clustering; Gabriel graph;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The training time for Support vector machine (SVM) depends largely on the size of the training set, which makes it impractical for large data sets. This paper presents a new method to reduce the size by combining two supplementary algorithms. The training data is partitioned into several pair-wise disjoint clusters by using k-means clustering algorithm. Then, the representatives of these clusters can be edited by Gabriel graph algorithm, based on which we can approximately identify the support vectors and nonsupport vectors. After de-clustering the marginal boundary clusters represented by support vectors and deleting the internal clusters represented by non-support vectors, the number of training data can be significantly reduced, thereby speeding up the training process. The proposed method was tested on both the artificial data and real data. Experiment results show that replacing the training set with the edited set obtained from Gabriel graph algorithm and k-means clustering technique as the training set, significantly reduces the training time for SVM, yet the classification accuracy remains nearly undegraded.
引用
收藏
页码:403 / +
页数:3
相关论文
共 21 条
[1]  
Achlioptas D, 2002, ADV NEUR IN, V14, P335
[2]  
AGARWAL K, 2002, 8 ACM SIGKDD, P173
[3]  
[Anonymous], 2001, LIBSVM LIB SUPPORT V
[4]  
BLAKE L, UCI REPOSITORY MACHI
[5]  
Boley D, 2004, SIAM PROC S, P126
[6]  
Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
[7]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[8]  
CHIHCHUNG C, 2002, IEEE T NEURAL NETWOR, V13, P415
[9]  
CUTTING DR, 1992, SIGIR 92 : PROCEEDINGS OF THE FIFTEENTH ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, P318
[10]   Training invariant support vector machines [J].
Decoste, D ;
Schölkopf, B .
MACHINE LEARNING, 2002, 46 (1-3) :161-190