A fast training algorithm for SVM via clustering technique and Gabriel graph

被引:0
|
作者
Li, Xia [1 ]
Wang, Na [1 ]
Li, Shu-Yuan [1 ]
机构
[1] Shenzhen Univ, Coll Informat Engn, Shenzhen 518060, Guangdong, Peoples R China
来源
ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF CONTEMPORARY INTELLIGENT COMPUTING TECHNIQUES | 2007年 / 2卷
关键词
support vector machine; fast training; clustering; Gabriel graph;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The training time for Support vector machine (SVM) depends largely on the size of the training set, which makes it impractical for large data sets. This paper presents a new method to reduce the size by combining two supplementary algorithms. The training data is partitioned into several pair-wise disjoint clusters by using k-means clustering algorithm. Then, the representatives of these clusters can be edited by Gabriel graph algorithm, based on which we can approximately identify the support vectors and nonsupport vectors. After de-clustering the marginal boundary clusters represented by support vectors and deleting the internal clusters represented by non-support vectors, the number of training data can be significantly reduced, thereby speeding up the training process. The proposed method was tested on both the artificial data and real data. Experiment results show that replacing the training set with the edited set obtained from Gabriel graph algorithm and k-means clustering technique as the training set, significantly reduces the training time for SVM, yet the classification accuracy remains nearly undegraded.
引用
收藏
页码:403 / +
页数:3
相关论文
共 50 条
  • [31] A context-sensitive clustering technique based on graph-cut initialization and expectation-maximization algorithm
    Tyagi, Mayank
    Bovolo, Francesca
    Mehra, Ankit K.
    Chaudhuri, Subhasis
    Bruzzone, Lorenzo
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2008, 5 (01) : 21 - 25
  • [32] A Fast Handwritten Digit Recognition Algorithm Based on Improved SVM
    Li, Qiong
    Chen, Li
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON INFORMATION SCIENCES, MACHINERY, MATERIALS AND ENERGY (ICISMME 2015), 2015, 126 : 2093 - 2096
  • [33] Diluvian Clustering: A Fast, Effective Algorithm for Clustering Compositional and Other Data
    Ritchie, Nicholas W. M.
    MICROSCOPY AND MICROANALYSIS, 2015, 21 (05) : 1173 - 1183
  • [34] Research on Voiceprint Recognition Based on Weighted Clustering Recognition SVM Algorithm
    Wu, Yang
    Xu, Lihong
    Chen, Yandong
    Zhang, Xueyang
    2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 1144 - 1148
  • [35] An Alternating Genetic Algorithm for Selecting SVM Model and Training Set
    Kawulok, Michal
    Nalepa, Jakub
    Dudzik, Wojciech
    PATTERN RECOGNITION (MCPR 2017), 2017, 10267 : 94 - 104
  • [36] Genetic algorithm–based training for semi-supervised SVM
    Mathias M. Adankon
    Mohamed Cheriet
    Neural Computing and Applications, 2010, 19 : 1197 - 1206
  • [37] Weighted bilateral K-means algorithm for fast co-clustering and fast spectral clustering
    Song, Kun
    Yao, Xiwen
    Nie, Feiping
    Li, Xuelong
    Xu, Mingliang
    PATTERN RECOGNITION, 2021, 109 (109)
  • [38] Fast SVM training using edge detection on very large datasets
    Li, Boyang
    Wang, Qiangwei
    Hu, Jinglu
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2013, 8 (03) : 229 - 237
  • [39] Shrinkage Clustering: a fast and size-constrained clustering algorithm for biomedical applications
    Hu, Chenyue W.
    Li, Hanyang
    Qutub, Amina A.
    BMC BIOINFORMATICS, 2018, 19
  • [40] Shrinkage Clustering: a fast and size-constrained clustering algorithm for biomedical applications
    Chenyue W. Hu
    Hanyang Li
    Amina A. Qutub
    BMC Bioinformatics, 19