Margin calibration in SVM class-imbalanced learning

被引:53
|
作者
Yang, Chan-Yun [1 ]
Yang, Jr-Syu [2 ]
Wang, Jian-Jun [3 ]
机构
[1] Technol & Sci Inst No Taiwan, Dept Mech Engn, Taipei 11202, Taiwan
[2] Tamkang Univ, Dept Mech & Electromech Engn, Tamsui 25137, Taipei County, Taiwan
[3] Southwest Univ, Sch Math & Stat, Chongqing 400715, Peoples R China
关键词
Margin; Cost-sensitive learning; Class-imbalanced learning; Support vector machines; Classification; SUPPORT VECTOR MACHINES; CLASSIFICATION; KERNEL; CONSISTENCY;
D O I
10.1016/j.neucom.2009.08.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imbalanced dataset learning is an important practical issue in machine learning, even in support vector machines (SVMs). In this study, a well known reference model for solving the problem proposed by Veropoulos et al., is first studied. From the aspect of loss function, the reference cost sensitive prototype is identified as a penalty-regularized model. Intuitively, the loss function can change not only the penalty but also the margin to recover the biased decision boundary. This study focuses mainly on the effect from the margin and then extends the model to a more general modification. As proposed in the prototype, the modification first adopts an inversed proportional regularized penalty to re-weight the imbalanced classes. In addition to the penalty regularization, the modification then employs a margin compensation to lead the margin to be lopsided, which enables the decision boundary drift. Two regularization factors, the penalty and margin. are hence suggested for achieving an unbiased classification. The margin compensation, associating with the penalty regularization, is here utilized to calibrate and refine the biased decision boundary to further reduce the bias. With the area under the receiver operating characteristic curve (AuROC) for examining the performance, the modification shows relative higher scores than the reference model, even though the optimal performance is achieved by the reference model. Some useful characteristics found empirically are also included, which may be convenient for the future applications. All the theoretical descriptions and experimental validations show the proposed model's potential to compete for highly unbiased accuracy in a complex imbalanced dataset. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:397 / 411
页数:15
相关论文
共 50 条
  • [41] Polarimetry-Inspired Contrastive Learning for Class-Imbalanced PolSAR Image Classification
    Kuang, Zuzheng
    Bi, Haixia
    Li, Fan
    Xu, Chen
    Sun, Jian
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 19
  • [42] ABC: Auxiliary Balanced Classifier for Class-Imbalanced Semi-Supervised Learning
    Lee, Hyuck
    Shin, Seungjae
    Kim, Heeyoung
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [43] A Diversity-Based Method for Class-Imbalanced Cost-Sensitive Learning
    Dong, Shangyan
    Wu, Yongcheng
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON MATHEMATICS AND ARTIFICIAL INTELLIGENCE (ICMAI 2018), 2018, : 51 - 55
  • [44] An Integrated Class-Imbalanced Learning Scheme for Diagnosing Bearing Defects in Induction Motors
    Razavi-Far, Roozbeh
    Farajzadeh-Zanjani, Maryam
    Saif, Mehrdad
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2017, 13 (06) : 2758 - 2769
  • [45] SGBGAN: minority class image generation for class-imbalanced datasets
    Wan, Qian
    Guo, Wenhui
    Wang, Yanjiang
    MACHINE VISION AND APPLICATIONS, 2024, 35 (02)
  • [46] A novel oversampling technique for class-imbalanced learning based on SMOTE and natural neighbors
    Li, Junnan
    Zhu, Qingsheng
    Wu, Quanwang
    Fan, Zhu
    INFORMATION SCIENCES, 2021, 565 : 438 - 455
  • [47] ABAE: Auxiliary Balanced AutoEncoder for class-imbalanced semi-supervised learning
    Tang, Qianying
    Wei, Xiang
    Su, Qi
    Zhang, Shunli
    PATTERN RECOGNITION LETTERS, 2024, 182 : 118 - 124
  • [48] Performance of Machine Learning Algorithms for Class-Imbalanced Process Fault Detection Problems
    Lee, Taehyung
    Lee, Ki Bum
    Kim, Chang Ouk
    IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, 2016, 29 (04) : 436 - 445
  • [49] Region-dependent temperature scaling for certainty calibration and application to class-imbalanced token classification
    Dawkins, Hillary
    Nejadgholi, Isar
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): (SHORT PAPERS), VOL 2, 2022, : 538 - 544
  • [50] Class prediction for high-dimensional class-imbalanced data
    Blagus, Rok
    Lusa, Lara
    BMC BIOINFORMATICS, 2010, 11 : 523