Sample selection based on K-L divergence for effectively training SVM

被引:3
作者
Zhai, Junhai [1 ]
Li, Chang [2 ]
Li, Ta [2 ]
Wang, Xizhao [1 ]
机构
[1] Hebei Univ, Coll Math & Comp Sci, Key Lab Machine Learning & Computat Intelligence, Baoding, Peoples R China
[2] Hebei Univ, Coll Math & Comp Sci, Baoding, Peoples R China
来源
2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013) | 2013年
基金
中国国家自然科学基金;
关键词
samples selection; PNN; K-L divergence; SVM; SUPPORT VECTOR MACHINES; RECOGNITION; CLASSIFIERS; REGRESSION;
D O I
10.1109/SMC.2013.823
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The computational time and space complexity of support vector machine (SVM) are O(n(3)) and O(n(2)) respectively, where n is the number of training samples. It is inefficient or impracticable to train an SVM on relatively large datasets. Actually, the removal of training samples that are not support vector (SVs) has no effect on constructing the optimal hyperplane. Based on this idea, this paper proposed a sample selection method which can efficiently choose the candidate SVs from original datasets. The selected samples are used to train SVM. The experimental results show that the proposed method is effective and efficient; it can efficiently reduce the computational complexity both of time and space especially on relatively large datasets.
引用
收藏
页码:4837 / 4842
页数:6
相关论文
共 25 条
[1]   Scaling Up Support Vector Machines Using Nearest Neighbor Condensation [J].
Angiulli, Fabrizio ;
Astorino, Annabella .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2010, 21 (02) :351-357
[2]  
[Anonymous], 2006, Elements of Information Theory
[3]  
[Anonymous], 0007 U WISC DAT MIN
[4]  
[Anonymous], 1999, ADV KERNEL METHODS S
[5]  
Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
[6]   SVM-KM:: speeding SVMs learning with a priori cluster selection and k-means [J].
de Almeida, MB ;
Braga, AD ;
Braga, JP .
SIXTH BRAZILIAN SYMPOSIUM ON NEURAL NETWORKS, VOL 1, PROCEEDINGS, 2000, :162-167
[7]   Large scale instance selection by means of federal instance selection [J].
de Haro-Garcia, Aida ;
Garcia-Pedrajas, Nicolas ;
Antonio Romero del Castillo, Juan .
DATA & KNOWLEDGE ENGINEERING, 2012, 75 :58-77
[8]  
Duda R.O., 2005, PATTERN CLASSIFICATI, V2
[9]  
Frank A., 2010, UCI machine learning repository, V213
[10]   Applications of support vector machines to speech recognition [J].
Ganapathiraju, A ;
Hamaker, JE ;
Picone, J .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (08) :2348-2355