A fast training algorithm for SVM via clustering technique and Gabriel graph

被引:0
|
作者
Li, Xia [1 ]
Wang, Na [1 ]
Li, Shu-Yuan [1 ]
机构
[1] Shenzhen Univ, Coll Informat Engn, Shenzhen 518060, Guangdong, Peoples R China
来源
ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF CONTEMPORARY INTELLIGENT COMPUTING TECHNIQUES | 2007年 / 2卷
关键词
support vector machine; fast training; clustering; Gabriel graph;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The training time for Support vector machine (SVM) depends largely on the size of the training set, which makes it impractical for large data sets. This paper presents a new method to reduce the size by combining two supplementary algorithms. The training data is partitioned into several pair-wise disjoint clusters by using k-means clustering algorithm. Then, the representatives of these clusters can be edited by Gabriel graph algorithm, based on which we can approximately identify the support vectors and nonsupport vectors. After de-clustering the marginal boundary clusters represented by support vectors and deleting the internal clusters represented by non-support vectors, the number of training data can be significantly reduced, thereby speeding up the training process. The proposed method was tested on both the artificial data and real data. Experiment results show that replacing the training set with the edited set obtained from Gabriel graph algorithm and k-means clustering technique as the training set, significantly reduces the training time for SVM, yet the classification accuracy remains nearly undegraded.
引用
收藏
页码:403 / +
页数:3
相关论文
共 50 条
  • [41] Chameleon 2: An Improved Graph-Based Clustering Algorithm
    Barton, Tomas
    Bruna, Tomas
    Kordik, Pavel
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2019, 13 (01)
  • [42] A new SVM integrated Rough Type-II Fuzzy Clustering Technique
    Sarkar, Jnanendra Prasad
    Saha, Indrajit
    Maulik, Ujjwal
    2014 9TH INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (ICIIS), 2014, : 477 - 482
  • [43] A simple and fast algorithm for K-medoids clustering
    Park, Hae-Sang
    Jun, Chi-Hyuck
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) : 3336 - 3341
  • [44] A fast parallel clustering algorithm for molecular simulation trajectories
    Zhao, Yutong
    Sheong, Fu Kit
    Sun, Jian
    Sander, Pedro
    Huang, Xuhui
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2013, 34 (02) : 95 - 104
  • [45] A clustering successive POCS algorithm for fast point matching
    Lian, Wei
    Liang, Yan
    Pan, Quan
    Chen, Yong-Mei
    Zhang, Hong-Cai
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 3903 - +
  • [46] A fast consistent grid-based clustering algorithm
    Tarasenko, Anton S.
    Berikov, Vladimir B.
    Pestunov, Igor A.
    Rylov, Sergey A.
    Ruzankin, Pavel S.
    PATTERN ANALYSIS AND APPLICATIONS, 2024, 27 (04)
  • [47] Fast clustering algorithm based on MST of representative points
    Du, Hui
    Lu, Depeng
    Wang, Zhihe
    Ma, Cuntao
    Shi, Xinxin
    Wang, Xiaoli
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (09) : 15830 - 15858
  • [48] Fast agglomerative clustering using a k-nearest neighbor graph
    Franti, Pasi
    Virmajoki, Olli
    Hautamaki, Ville
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (11) : 1875 - 1881
  • [49] Joint Node Representation Learning and Clustering for Attributed Graph via Graph Diffusion Convolution
    Guo, Yiwei
    Kang, Le
    Wu, Mengqi
    Zhou, Lijuan
    Zhang, Zhihong
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [50] Fast Algorithm for Integrating Clustering with Ranking on Heterogeneous Graphs
    Yamazaki, Kotaro
    Sato, Tomoki
    Shiokawa, Hiroaki
    Kitagawa, Hiroyuki
    IIWAS2018: THE 20TH INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES, 2014, : 24 - 32