Instance-based entropy fuzzy support vector machine for imbalanced data

被引:0
作者
Poongjin Cho
Minhyuk Lee
Woojin Chang
机构
[1] Seoul National University,Department of Industrial Engineering
[2] Samsung Electronics,Big Data Analytics Group, Mobile Communications Business
来源
Pattern Analysis and Applications | 2020年 / 23卷
关键词
Fuzzy support vector machine; Imbalanced dataset; Entropy; Pattern recognition; Nearest neighbor;
D O I
暂无
中图分类号
学科分类号
摘要
Imbalanced classification has been a major challenge for machine learning because many standard classifiers mainly focus on balanced datasets and tend to have biased results toward the majority class. We modify entropy fuzzy support vector machine (EFSVM) and introduce instance-based entropy fuzzy support vector machine (IEFSVM). Both EFSVM and IEFSVM use the entropy information of k-nearest neighbors to determine the fuzzy membership value for each sample which prioritizes the importance of each sample. IEFSVM considers the diversity of entropy patterns for each sample when increasing the size of neighbors, k, while EFSVM uses single entropy information of the fixed size of neighbors for all samples. By varying k, we can reflect the component change of sample’s neighbors from near to far distance in the determination of fuzzy value membership. Numerical experiments on 35 public and 12 real-world imbalanced datasets are performed to validate IEFSVM, and area under the receiver operating characteristic curve (AUC) is used to compare its performance with other SVMs and machine learning methods. IEFSVM shows a much higher AUC value for datasets with high imbalance ratio, implying that IEFSVM is effective in dealing with the class imbalance problem.
引用
收藏
页码:1183 / 1202
页数:19
相关论文
共 50 条
  • [31] A comparative study of surface EMG classification by fuzzy relevance vector machine and fuzzy support vector machine
    Xie, Hong-Bo
    Huang, Hu
    Wu, Jianhua
    Liu, Lei
    PHYSIOLOGICAL MEASUREMENT, 2015, 36 (02) : 191 - 206
  • [32] Efficient sleep classification based on entropy features and a support vector machine classifier
    Zhang, Zhimin
    Wei, Shoushui
    Zhu, Guohun
    Liu, Feifei
    Li, Yuwen
    Dong, Xiaotong
    Liu, Chengyu
    Liu, Feng
    PHYSIOLOGICAL MEASUREMENT, 2018, 39 (11)
  • [33] A multi-classified method of Support Vector Machine (SVM) based on Entropy
    Yue, Yan
    INDUSTRIAL INSTRUMENTATION AND CONTROL SYSTEMS, PTS 1-4, 2013, 241-244 : 1629 - 1632
  • [34] Imbalanced Data Problem of Relevance Vector Machine Customer Identification
    Li, Gang
    Zhang, Li
    Wang, Gui-long
    ADVANCES IN COMPUTER SCIENCE AND EDUCATION APPLICATIONS, PT II, 2011, 202 : 448 - 454
  • [35] Application of Fuzzy Entropy to Improve Feature Selection for Defect Recognition Using Support Vector Machine in High Voltage Cable Joints
    Chang, Chien-Kuo
    Boyanapalli, Bharath Kumar
    Wu, Ruay-Nan
    IEEE TRANSACTIONS ON DIELECTRICS AND ELECTRICAL INSULATION, 2020, 27 (06) : 2147 - 2155
  • [36] Fault Diagnosis Model Based on Fuzzy Support Vector Machine Combined with Weighted Fuzzy Clustering
    张俊红
    马文朋
    马梁
    何振鹏
    Transactions of Tianjin University, 2013, (03) : 174 - 181
  • [37] Fault diagnosis model based on fuzzy support vector machine combined with weighted fuzzy clustering
    Zhang J.
    Ma W.
    Ma L.
    He Z.
    Transactions of Tianjin University, 2013, 19 (03) : 174 - 181
  • [38] Fuzzy support vector machine method based on multi-region partition
    Zha, Xiang
    Ni, Shihong
    Zhang, Peng
    Zhongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Central South University (Science and Technology), 2015, 46 (05): : 1680 - 1687
  • [39] APPLICATION OF SPORTS VIDEO IMAGE ANALYSIS BASED ON FUZZY SUPPORT VECTOR MACHINE
    GAO L.
    ZHAO Y.
    Scalable Computing, 2024, 25 (03): : 1790 - 1798
  • [40] Novel Robustness Image Watermarking Scheme Based on Fuzzy Support Vector Machine
    Li, Lei
    Ding, Wen-Yan
    Li, Jin-Yan
    PROCEEDINGS OF 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (ICCSIT 2010), VOL 6, 2010, : 533 - 537