FSVM-CIL: Fuzzy Support Vector Machines for Class Imbalance Learning

被引:310
作者
Batuwita, Rukshan [1 ]
Palade, Vasile [1 ]
机构
[1] Univ Oxford, Comp Lab, Oxford OX1 3QD, England
关键词
Class imbalance learning (CIL); fuzzy support vector machines (FSVMs); outliers; support vector machines (SVMs); CLASSIFICATION;
D O I
10.1109/TFUZZ.2010.2042721
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Support vector machines (SVMs) is a popular machine learning technique, which works effectively with balanced datasets. However, when it comes to imbalanced datasets, SVMs produce suboptimal classification models. On the other hand, the SVM algorithm is sensitive to outliers and noise present in the datasets. Therefore, although the existing class imbalance learning (CIL) methods can make SVMs less sensitive to class imbalance, they can still suffer from the problem of outliers and noise. Fuzzy SVMs (FSVMs) is a variant of the SVM algorithm, which has been proposed to handle the problem of outliers and noise. In FSVMs, training examples are assigned different fuzzy-membership values based on their importance, and these membership values are incorporated into the SVM learning algorithm to make it less sensitive to outliers and noise. However, like the normal SVM algorithm, FSVMs can also suffer from the problem of class imbalance. In this paper, we present a method to improve FSVMs for CIL (called FSVM-CIL), which can be used to handle the class imbalance problem in the presence of outliers and noise. We thoroughly evaluated the proposed FSVM-CIL method on ten real-world imbalanced datasets and compared its performance with five existing CIL methods, which are available for normal SVM training. Based on the overall results, we can conclude that the proposed FSVM-CIL method is a very effective method for CIL, especially in the presence of outliers and noise in datasets.
引用
收藏
页码:558 / 571
页数:14
相关论文
共 60 条
  • [11] NEURAL NETS FOR FUZZY-SYSTEMS
    BUCKLEY, JJ
    HAYASHI, Y
    [J]. FUZZY SETS AND SYSTEMS, 1995, 71 (03) : 265 - 276
  • [12] Extraction of fuzzy rules from support vector machines
    Castro, J. L.
    Flores-Hidalgo, L. D.
    Mantas, C. J.
    Puche, J. M.
    [J]. FUZZY SETS AND SYSTEMS, 2007, 158 (18) : 2057 - 2077
  • [13] LIBSVM: A Library for Support Vector Machines
    Chang, Chih-Chung
    Lin, Chih-Jen
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
  • [14] Chaves A. C., 2005, 5 INT C HYBR INT SYS
  • [15] Chawla N. V., 2004, ACM SIGKDD Explorations Newsletter, V6, P1
  • [16] SMOTE: Synthetic minority over-sampling technique
    Chawla, Nitesh V.
    Bowyer, Kevin W.
    Hall, Lawrence O.
    Kegelmeyer, W. Philip
    [J]. 2002, American Association for Artificial Intelligence (16)
  • [17] SMOTEBoost: Improving prediction of the minority class in boosting
    Chawla, NV
    Lazarevic, A
    Hall, LO
    Bowyer, KW
    [J]. KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2003, PROCEEDINGS, 2003, 2838 : 107 - 119
  • [18] CHEN L, 2007, P 4 INT S NEUR NETW, P1291
  • [19] Support vector learning for fuzzy rule-based classification systems
    Chen, YX
    Wang, JZ
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2003, 11 (06) : 716 - 728
  • [20] Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482