An Improved SVM-KM Model For Imbalanced Datasets

被引:35
作者
Deng Weiguo [1 ]
Wang Li [1 ]
Wang Yiyang [1 ]
Qian Zhong [1 ]
机构
[1] Beihang Univ, Sch Econ & Management, Beijing, Peoples R China
来源
2012 INTERNATIONAL CONFERENCE ON INDUSTRIAL CONTROL AND ELECTRONICS ENGINEERING (ICICEE) | 2012年
关键词
support vector machine; k-means; different error costs; imbalanced datasets; SELECTION;
D O I
10.1109/ICICEE.2012.35
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Support vector machine is a widely used machine learning technique. SVM-KM model can speed SVM training by eliminating non support vectors, but imbalanced datasets will affect the classification accuracy. In this paper, we proposed an improved SVM-KM model, which assign different error costs to different classes. Based on the simulation results, the improved SVM-KM model performed best for imbalanced datasets.
引用
收藏
页码:100 / 103
页数:4
相关论文
共 18 条
[1]   Applying support vector machines to imbalanced datasets [J].
Akbani, R ;
Kwek, S ;
Japkowicz, N .
MACHINE LEARNING: ECML 2004, PROCEEDINGS, 2004, 3201 :39-50
[2]  
[Anonymous], 1999, Proceedings of the International Joint Conference on Artificial Intelligence
[3]  
[Anonymous], INT C UB INF MAN COM
[4]   FSVM-CIL: Fuzzy Support Vector Machines for Class Imbalance Learning [J].
Batuwita, Rukshan ;
Palade, Vasile .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2010, 18 (03) :558-571
[5]   SVM-KM:: speeding SVMs learning with a priori cluster selection and k-means [J].
de Almeida, MB ;
Braga, AD ;
Braga, JP .
SIXTH BRAZILIAN SYMPOSIUM ON NEURAL NETWORKS, VOL 1, PROCEEDINGS, 2000, :162-167
[6]   Clustering-Based Extraction of Border Training Patterns for Accurate SVM Classification of Hyperspectral Images [J].
Demir, Beguem ;
Erturk, Sarp .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2009, 6 (04) :840-844
[7]   SVM-RFE based feature selection for tandem mass spectrum quality assessment [J].
Ding, Jiarui ;
Shi, Jinhong ;
Wu, Fang-Xiang .
INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2011, 5 (01) :73-88
[8]   Hierarchically SVM classification based on support vector clustering method and its application to document categorization [J].
Hao, Pei-Yi ;
Chiang, Jung-Hsien ;
Tu, Yi-Kun .
EXPERT SYSTEMS WITH APPLICATIONS, 2007, 33 (03) :627-635
[9]   Learning from Imbalanced Data [J].
He, Haibo ;
Garcia, Edwardo A. .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (09) :1263-1284
[10]   Ranking and Selecting Terms for Text Categorization via SVM Discriminate Boundary [J].
Kuo, Tien-Fang ;
Yajima, Yasutoshi .
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2010, 25 (02) :137-154