Fast instance selection for speeding up support vector machines

被引:68
作者
Chen, Jingnian [1 ]
Zhang, Caiming [2 ]
Xue, Xiaoping [3 ]
Liu, Cheng-Lin [4 ]
机构
[1] Shandong Univ Finance & Econ, Dept Informat & Comp Sci, Jinan 250014, Peoples R China
[2] Shandong Univ, Sch Comp Sci & Technol, Jinan 250014, Peoples R China
[3] Tongji Univ, Sch Elect & Informat, Shanghai 201804, Peoples R China
[4] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
SVM; Classification; Multi-class; Instance selection; Clustering;
D O I
10.1016/j.knosys.2013.01.031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Support vector machine (SVM) has shown prominent performance for binary classification. How to effectively apply it to massive datasets with large number of classes and instances is still a serious challenge. Instance selection methods have been proposed and shown significant efficacy for reducing the training complexity of SVM, but more or less trade off the generalization performance. This paper presents an instance selection method especially for multi-class problems. With cluster centers of positive class as reference points instances are selected for each one-versus-rest SVM model. The purpose of clustering here is to improve the efficiency of instance selection, other than to select instances directly from clusters as previous methods did. Experiments on a wide variety of datasets demonstrate that the proposed method selects fewer instances than most competitive algorithms and keeps the highest classification accuracy on most datasets. Additionally, experimental results show that this method also performs superiorly for binary problems. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 7
页数:7
相关论文
共 50 条
[41]   Scaling Up Sparse Support Vector Machines by Simultaneous Feature and Sample Reduction [J].
Hong, Bin ;
Zhang, Weizhong ;
Liu, Wei ;
Ye, Jieping ;
Cai, Deng ;
He, Xiaofei ;
Wang, Jie .
JOURNAL OF MACHINE LEARNING RESEARCH, 2019, 20
[42]   Evolutionary wrapper approaches for training set selection as preprocessing mechanism for support vector machines: Experimental evaluation and support vector analysis [J].
Verbiest, Nele ;
Derrac, Joaquin ;
Cornelis, Chris ;
Garcia, Salvador ;
Herrera, Francisco .
APPLIED SOFT COMPUTING, 2016, 38 :10-22
[43]   Synchronized feature selection for Support Vector Machines with twin hyperplanes [J].
Maldonado, Sebastian ;
Lopez, Julio .
KNOWLEDGE-BASED SYSTEMS, 2017, 132 :119-128
[44]   An Hybrid Parallel Implementation of Model Selection for Support Vector Machines [J].
Ripepi, Giuseppe ;
Clematis, Andrea ;
D'Agostino, Daniele .
23RD EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2015), 2015, :145-149
[45]   A wrapper method for feature selection using Support Vector Machines [J].
Maldonado, Sebastian ;
Weber, Richard .
INFORMATION SCIENCES, 2009, 179 (13) :2208-2217
[46]   Variable selection for support vector machines in moderately high dimensions [J].
Zhang, Xiang ;
Wu, Yichao ;
Wang, Lan ;
Li, Runze .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2016, 78 (01) :53-76
[47]   Cost-sensitive Feature Selection for Support Vector Machines [J].
Benitez-Pena, S. ;
Blanquero, R. ;
Carrizosa, E. ;
Ramirez-Cobo, P. .
COMPUTERS & OPERATIONS RESEARCH, 2019, 106 :169-178
[48]   Instance categorization by support vector machines to adjust weights in AdaBoost for imbalanced data classification [J].
Lee, Wonji ;
Jun, Chi-Hyuck ;
Lee, Jong-Seok .
INFORMATION SCIENCES, 2017, 381 :92-103
[49]   Reinforced Multicategory Support Vector Machines [J].
Liu, Yufeng ;
Yuan, Ming .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2011, 20 (04) :901-919
[50]   Twin support vector machines: A survey [J].
Huang, Huajuan ;
Wei, Xiuxi ;
Zhou, Yongquan .
NEUROCOMPUTING, 2018, 300 :34-43