Instance-based entropy fuzzy support vector machine for imbalanced data

被引:0
|
作者
Poongjin Cho
Minhyuk Lee
Woojin Chang
机构
[1] Seoul National University,Department of Industrial Engineering
[2] Samsung Electronics,Big Data Analytics Group, Mobile Communications Business
来源
Pattern Analysis and Applications | 2020年 / 23卷
关键词
Fuzzy support vector machine; Imbalanced dataset; Entropy; Pattern recognition; Nearest neighbor;
D O I
暂无
中图分类号
学科分类号
摘要
Imbalanced classification has been a major challenge for machine learning because many standard classifiers mainly focus on balanced datasets and tend to have biased results toward the majority class. We modify entropy fuzzy support vector machine (EFSVM) and introduce instance-based entropy fuzzy support vector machine (IEFSVM). Both EFSVM and IEFSVM use the entropy information of k-nearest neighbors to determine the fuzzy membership value for each sample which prioritizes the importance of each sample. IEFSVM considers the diversity of entropy patterns for each sample when increasing the size of neighbors, k, while EFSVM uses single entropy information of the fixed size of neighbors for all samples. By varying k, we can reflect the component change of sample’s neighbors from near to far distance in the determination of fuzzy value membership. Numerical experiments on 35 public and 12 real-world imbalanced datasets are performed to validate IEFSVM, and area under the receiver operating characteristic curve (AUC) is used to compare its performance with other SVMs and machine learning methods. IEFSVM shows a much higher AUC value for datasets with high imbalance ratio, implying that IEFSVM is effective in dealing with the class imbalance problem.
引用
收藏
页码:1183 / 1202
页数:19
相关论文
共 50 条
  • [1] Instance-based entropy fuzzy support vector machine for imbalanced data
    Cho, Poongjin
    Lee, Minhyuk
    Chang, Woojin
    PATTERN ANALYSIS AND APPLICATIONS, 2020, 23 (03) : 1183 - 1202
  • [2] Entropy-based fuzzy support vector machine for imbalanced datasets
    Fan, Qi
    Wang, Zhe
    Li, Dongdong
    Gao, Daqi
    Zha, Hongyuan
    KNOWLEDGE-BASED SYSTEMS, 2017, 115 : 87 - 99
  • [3] Application of Instance-Based Entropy Fuzzy Support Vector Machine in Peer-To-Peer Lending Investment Decision
    Cho, Poongjin
    Chang, Woojin
    Song, Jae Wook
    IEEE ACCESS, 2019, 7 : 16925 - 16939
  • [4] Fuzzy support vector machine for imbalanced data with borderline noise
    Liu, Jie
    FUZZY SETS AND SYSTEMS, 2021, 413 : 64 - 73
  • [5] Deep Learning-Based Imbalanced Classification With Fuzzy Support Vector Machine
    Wang, Ke-Fan
    An, Jing
    Wei, Zhen
    Cui, Can
    Ma, Xiang-Hua
    Ma, Chao
    Bao, Han-Qiu
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2022, 9
  • [6] Entropy-based matrix learning machine for imbalanced data sets
    Zhu, Changming
    Wang, Zhe
    PATTERN RECOGNITION LETTERS, 2017, 88 : 72 - 80
  • [7] Performance of Support Vector Machine in Imbalanced Data Set
    Novakovic, Jasmina
    Markovic, Suzana
    2020 19TH INTERNATIONAL SYMPOSIUM INFOTEH-JAHORINA (INFOTEH), 2020,
  • [8] Imbalanced Data Classification Based on Hybrid Resampling and Twin Support Vector Machine
    Cao, Lu
    Shen, Hong
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2017, 14 (03) : 579 - 595
  • [9] Kernel local outlier factor-based fuzzy support vector machine for imbalanced classification
    Wang, Kefan
    An, Jing
    Yu, Zibo
    Yin, Xingshu
    Ma, Chao
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (13)
  • [10] A fuzzy twin support vector machine based on information entropy for class imbalance learning
    Deepak Gupta
    Bharat Richhariya
    Parashjyoti Borah
    Neural Computing and Applications, 2019, 31 : 7153 - 7164