Instance-based entropy fuzzy support vector machine for imbalanced data

被引:0
|
作者
Poongjin Cho
Minhyuk Lee
Woojin Chang
机构
[1] Seoul National University,Department of Industrial Engineering
[2] Samsung Electronics,Big Data Analytics Group, Mobile Communications Business
来源
Pattern Analysis and Applications | 2020年 / 23卷
关键词
Fuzzy support vector machine; Imbalanced dataset; Entropy; Pattern recognition; Nearest neighbor;
D O I
暂无
中图分类号
学科分类号
摘要
Imbalanced classification has been a major challenge for machine learning because many standard classifiers mainly focus on balanced datasets and tend to have biased results toward the majority class. We modify entropy fuzzy support vector machine (EFSVM) and introduce instance-based entropy fuzzy support vector machine (IEFSVM). Both EFSVM and IEFSVM use the entropy information of k-nearest neighbors to determine the fuzzy membership value for each sample which prioritizes the importance of each sample. IEFSVM considers the diversity of entropy patterns for each sample when increasing the size of neighbors, k, while EFSVM uses single entropy information of the fixed size of neighbors for all samples. By varying k, we can reflect the component change of sample’s neighbors from near to far distance in the determination of fuzzy value membership. Numerical experiments on 35 public and 12 real-world imbalanced datasets are performed to validate IEFSVM, and area under the receiver operating characteristic curve (AUC) is used to compare its performance with other SVMs and machine learning methods. IEFSVM shows a much higher AUC value for datasets with high imbalance ratio, implying that IEFSVM is effective in dealing with the class imbalance problem.
引用
收藏
页码:1183 / 1202
页数:19
相关论文
共 50 条
  • [21] A fast fuzzy support vector machine based on information granulation
    Shifei Ding
    Youzhen Han
    Junzhao Yu
    Yaxiang Gu
    Neural Computing and Applications, 2013, 23 : 139 - 144
  • [22] Fuzzy support vector machine based on affinity among samples
    Zhang, Xiang
    Xiao, Xiao-Ling
    Xu, Guang-You
    Ruan Jian Xue Bao/Journal of Software, 2006, 17 (05): : 951 - 958
  • [23] Detecting Advanced Persistent Threats Based on Entropy and Support Vector Machine
    Tan, Jiayu
    Wang, Jian
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2018, PT IV, 2018, 11337 : 153 - 165
  • [24] The research on the method of feature selection in support vector Machine based Entropy
    Zhu, Xiaoyan
    Tian, Xi
    Zhu, Xiaoxun
    PROGRESS IN POWER AND ELECTRICAL ENGINEERING, PTS 1 AND 2, 2012, 354-355 : 1192 - +
  • [25] Entropy based disease classification of proteomic mass spectrometry data of the human serum by a support vector machine
    Kristensen, T
    Kumar, G
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 542 - 545
  • [26] Infrasound signal classification based on spectral entropy and support vector machine
    Li, Mei
    Liu, Xueyong
    Liu, Xu
    APPLIED ACOUSTICS, 2016, 113 : 116 - 120
  • [27] Algorithm of Fuzzy Support Vector Machine based on a Piecewise Linear Fuzzy Weight Method
    Yuan, Yong-bin
    Lan, Sheng
    Yu, Xu
    Yu, Miao
    INTERNATIONAL JOURNAL OF COGNITIVE INFORMATICS AND NATURAL INTELLIGENCE, 2018, 12 (02) : 62 - 76
  • [28] NEW FUZZY SUPPORT VECTOR MACHINE BASED ON MIXED KERNEL FUNCTION
    Lu, Yan-Ling
    Li, Lei
    Zhou, Meng-Meng
    Tian, Guo-Liang
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-6, 2009, : 526 - +
  • [29] Network intrusion detection model based on fuzzy support vector machine
    Long, Yanjun
    Ouyang, Jianquan
    Sun, Xinwen
    Journal of Networks, 2013, 8 (06) : 1387 - 1394
  • [30] Fuzzy Support Vector Machine for bankruptcy prediction
    Chaudhuri, Arindam
    De, Kajal
    APPLIED SOFT COMPUTING, 2011, 11 (02) : 2472 - 2486