Instance-based entropy fuzzy support vector machine for imbalanced data

被引:0
作者
Poongjin Cho
Minhyuk Lee
Woojin Chang
机构
[1] Seoul National University,Department of Industrial Engineering
[2] Samsung Electronics,Big Data Analytics Group, Mobile Communications Business
来源
Pattern Analysis and Applications | 2020年 / 23卷
关键词
Fuzzy support vector machine; Imbalanced dataset; Entropy; Pattern recognition; Nearest neighbor;
D O I
暂无
中图分类号
学科分类号
摘要
Imbalanced classification has been a major challenge for machine learning because many standard classifiers mainly focus on balanced datasets and tend to have biased results toward the majority class. We modify entropy fuzzy support vector machine (EFSVM) and introduce instance-based entropy fuzzy support vector machine (IEFSVM). Both EFSVM and IEFSVM use the entropy information of k-nearest neighbors to determine the fuzzy membership value for each sample which prioritizes the importance of each sample. IEFSVM considers the diversity of entropy patterns for each sample when increasing the size of neighbors, k, while EFSVM uses single entropy information of the fixed size of neighbors for all samples. By varying k, we can reflect the component change of sample’s neighbors from near to far distance in the determination of fuzzy value membership. Numerical experiments on 35 public and 12 real-world imbalanced datasets are performed to validate IEFSVM, and area under the receiver operating characteristic curve (AUC) is used to compare its performance with other SVMs and machine learning methods. IEFSVM shows a much higher AUC value for datasets with high imbalance ratio, implying that IEFSVM is effective in dealing with the class imbalance problem.
引用
收藏
页码:1183 / 1202
页数:19
相关论文
共 50 条
  • [41] MULTI-CLASS FUZZY SUPPORT VECTOR MACHINE BASED ON DISMISSING MARGIN
    Yan, Wei-Yun
    He, Qiang
    [J]. PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-6, 2009, : 1139 - +
  • [42] APPLICATION OF SPORTS VIDEO IMAGE ANALYSIS BASED ON FUZZY SUPPORT VECTOR MACHINE
    Gao, Licheng
    Zhao, Yawen
    [J]. SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2024, 25 (03): : 1790 - 1798
  • [43] The Research on the algorithm of nonlinear Support Vector Classification Machine based on Fuzzy theory
    Wang, Aimin
    Ge, Wenying
    Yang, Zhimin
    [J]. FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 1, PROCEEDINGS, 2008, : 479 - +
  • [44] Ant-Based Feature and Instance Selection for Multiclass Imbalanced Data
    Villuendas-Rey, Yenny
    Yanez-Marquez, Cornelio
    Camacho-Nieto, Oscar
    [J]. IEEE ACCESS, 2024, 12 : 133952 - 133968
  • [45] Fuzzy support vector machine for PolSAR image classification
    Ke, Hongxia
    Liu, Guodong
    Pan, Guobing
    [J]. ADVANCES IN CIVIL INFRASTRUCTURE ENGINEERING, PTS 1 AND 2, 2013, 639-640 : 1162 - 1167
  • [46] A new fuzzy support vector machine with pinball loss
    Verma R.N.
    Deo R.
    Srivastava R.
    Subbarao N.
    Singh G.P.
    [J]. Discover Artificial Intelligence, 2023, 3 (01):
  • [47] A new Fuzzy Support Vector Machine for Credit Scoring
    Tang, Bo
    Xia, Min
    [J]. EMERGING SYSTEMS FOR MATERIALS, MECHANICS AND MANUFACTURING, 2012, 109 : 636 - +
  • [48] Fuzzy Support Vector Machine for Eye Expression Analysis
    Yin Fangping
    Li Wanbiao
    [J]. ICIC 2009: SECOND INTERNATIONAL CONFERENCE ON INFORMATION AND COMPUTING SCIENCE, VOL 4, PROCEEDINGS: MODELLING AND SIMULATION IN ENGINEERING, 2009, : 182 - +
  • [49] Fuzzy support vector machine with joint optimization of genetic algorithm and fuzzy c-means
    Li, Ming-Ai
    Wang, Ruo-Tu
    Wei, Li-Na
    [J]. TECHNOLOGY AND HEALTH CARE, 2021, 29 (05) : 921 - 937
  • [50] Detecting anomalous traffic in the controlled network based on cross entropy and support vector machine
    Han, Weijie
    Xue, Jingfeng
    Yan, Hui
    [J]. IET INFORMATION SECURITY, 2019, 13 (02) : 109 - 116