SVM classification for imbalanced data sets using a multiobjective optimization framework

被引:12
|
作者
Askan, Aysegul [1 ]
Sayin, Serpil [2 ]
机构
[1] Garanti Teknol, TR-34212 Istanbul, Turkey
[2] Koc Univ, Coll Adm Sci & Econ, TR-34450 Istanbul, Turkey
关键词
SVM; Imbalanced data; Multiobjective optimization; Efficient frontier; ROBUST;
D O I
10.1007/s10479-012-1300-5
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Classification of imbalanced data sets in which negative instances outnumber the positive instances is a significant challenge. These data sets are commonly encountered in real-life problems. However, performance of well-known classifiers is limited in such cases. Various solution approaches have been proposed for the class imbalance problem using either data-level or algorithm-level modifications. Support Vector Machines (SVMs) that have a solid theoretical background also encounter a dramatic decrease in performance when the data distribution is imbalanced. In this study, we propose an L-1-norm SVM approach that is based on a three objective optimization problem so as to incorporate into the formulation the error sums for the two classes independently. Motivated by the inherent multi objective nature of the SVMs, the solution approach utilizes a reduction into two criteria formulations and investigates the efficient frontier systematically. The results indicate that a comprehensive treatment of distinct positive and negative error levels may lead to performance improvements that have varying degrees of increased computational effort.
引用
收藏
页码:191 / 203
页数:13
相关论文
共 50 条
  • [1] SVM classification for imbalanced data sets using a multiobjective optimization framework
    Ayşegül Aşkan
    Serpil Sayın
    Annals of Operations Research, 2014, 216 : 191 - 203
  • [2] Improving SVM Classification on Imbalanced Data Sets in Distance Spaces
    Koeknar-Tezel, Suzan
    Latecki, Longin Jan
    2009 9TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2009, : 259 - +
  • [3] Using Evolutionary Multiobjective Techniques for Imbalanced Classification Data
    Garcia, Sandra
    Aler, Ricardo
    Maria Galvan, Ines
    ARTIFICIAL NEURAL NETWORKS-ICANN 2010, PT I, 2010, 6352 : 422 - 427
  • [4] Improving SVM classification on imbalanced time series data sets with ghost points
    Koeknar-Tezel, Suzan
    Latecki, Longin Jan
    KNOWLEDGE AND INFORMATION SYSTEMS, 2011, 28 (01) : 1 - 23
  • [5] Improving SVM classification on imbalanced time series data sets with ghost points
    Suzan Köknar-Tezel
    Longin Jan Latecki
    Knowledge and Information Systems, 2011, 28 : 1 - 23
  • [6] SVM ensemble training for imbalanced data classification using multi-objective optimization techniques
    Joanna Grzyb
    Michał Woźniak
    Applied Intelligence, 2023, 53 : 15424 - 15441
  • [7] SVM ensemble training for imbalanced data classification using multi-objective optimization techniques
    Grzyb, Joanna
    Wozniak, Michal
    APPLIED INTELLIGENCE, 2023, 53 (12) : 15424 - 15441
  • [8] SVM Classification for Imbalanced Data Using Conformal Kernel Transformation
    Zhang, Yong
    Fu, Panpan
    Liu, Wenzhe
    Zou, Li
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 2894 - 2900
  • [9] Imbalanced Data Sets Classification Based on SVM for Sand-Dust Storm Warning
    Xie, Yonghua
    Liu, Yurong
    Fu, Qingqiu
    DISCRETE DYNAMICS IN NATURE AND SOCIETY, 2015, 2015
  • [10] A Submodular Optimization Framework for Imbalanced Text Classification With Data Augmentation
    Alemayehu, Eyor
    Fang, Yi
    IEEE ACCESS, 2023, 11 : 41680 - 41696