Fast instance selection for speeding up support vector machines

被引:67
作者
Chen, Jingnian [1 ]
Zhang, Caiming [2 ]
Xue, Xiaoping [3 ]
Liu, Cheng-Lin [4 ]
机构
[1] Shandong Univ Finance & Econ, Dept Informat & Comp Sci, Jinan 250014, Peoples R China
[2] Shandong Univ, Sch Comp Sci & Technol, Jinan 250014, Peoples R China
[3] Tongji Univ, Sch Elect & Informat, Shanghai 201804, Peoples R China
[4] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
SVM; Classification; Multi-class; Instance selection; Clustering;
D O I
10.1016/j.knosys.2013.01.031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Support vector machine (SVM) has shown prominent performance for binary classification. How to effectively apply it to massive datasets with large number of classes and instances is still a serious challenge. Instance selection methods have been proposed and shown significant efficacy for reducing the training complexity of SVM, but more or less trade off the generalization performance. This paper presents an instance selection method especially for multi-class problems. With cluster centers of positive class as reference points instances are selected for each one-versus-rest SVM model. The purpose of clustering here is to improve the efficiency of instance selection, other than to select instances directly from clusters as previous methods did. Experiments on a wide variety of datasets demonstrate that the proposed method selects fewer instances than most competitive algorithms and keeps the highest classification accuracy on most datasets. Additionally, experimental results show that this method also performs superiorly for binary problems. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 7
页数:7
相关论文
共 50 条
  • [21] An efficient instance selection algorithm to reconstruct training set for support vector machine
    Liu, Chuan
    Wang, Wenyong
    Wang, Meng
    Lv, Fengmao
    Konan, Martin
    KNOWLEDGE-BASED SYSTEMS, 2017, 116 : 58 - 73
  • [22] A New Convex Loss Function For Multiple Instance Support Vector Machines
    Kim, Sang-Baeg
    Bae, Jung-Man
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9023 - 9029
  • [23] Selection of Support Vector Candidates Using Relative Support Distance for Sustainability in Large-Scale Support Vector Machines
    Ryu, Minho
    Lee, Kichun
    APPLIED SCIENCES-BASEL, 2020, 10 (19): : 1 - 14
  • [24] SVOIS: Support Vector Oriented Instance Selection for text classification
    Tsai, Chih-Fong
    Chang, Che-Wei
    INFORMATION SYSTEMS, 2013, 38 (08) : 1070 - 1083
  • [25] Fast classification for large data sets via random selection clustering and Support Vector Machines
    Li, Xiaoou
    Cervantes, Jair
    Yu, Wen
    INTELLIGENT DATA ANALYSIS, 2012, 16 (06) : 897 - 914
  • [26] Expected margin-based pattern selection for support vector machines
    Kim, Dongil
    Kang, Seokho
    Cho, Sungzoon
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 139 (139)
  • [27] Grey wolf optimization based parameter selection for support vector machines
    Eswaramoorthy, Sathish
    Sivakumaran, N.
    Sekaran, Sankaranarayanan
    COMPEL-THE INTERNATIONAL JOURNAL FOR COMPUTATION AND MATHEMATICS IN ELECTRICAL AND ELECTRONIC ENGINEERING, 2016, 35 (05) : 1513 - 1523
  • [28] Selection of Support Vector Machines based classifiers for credit risk domain
    Danenas, Paulius
    Garsva, Gintautas
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (06) : 3194 - 3204
  • [29] Fast training of Support Vector Machines with Gaussian kernel
    Fischetti, Matteo
    DISCRETE OPTIMIZATION, 2016, 22 : 183 - 194
  • [30] A Learning Approach for Fast Training of Support Vector Machines
    Guo, Jun
    Chen, Youguang
    Wang, Su
    Liu, Xiaoping
    IEEE: 2009 INTERNATIONAL CONFERENCE ON E-LEARNING, E-BUSINESS, ENTERPRISE INFORMATION SYSTEMS AND E-GOVERNMENT, 2009, : 122 - 125