Article FLEX-SMOTE: Synthetic over-sampling technique that flexibly adjusts to different minority class distributions

被引:2
|
作者
Bunkhumpornpat, Chumphol [1 ,2 ]
Boonchieng, Ekkarat [1 ,2 ]
Chouvatut, Varin [1 ,2 ]
Lipsky, David [1 ,2 ]
机构
[1] Chiang Mai Univ, Fac Sci, Dept Comp Sci, Chiang Mai 50200, Thailand
[2] Chiang Mai Univ, Ctr Excellence Community Hlth Informat, Chiang Mai 50200, Thailand
来源
PATTERNS | 2024年 / 5卷 / 11期
关键词
Highlights;
D O I
10.1016/j.patter.2024.101073
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
THE BIGGER PICTURE Machine learning methods often encounter imbalanced classification problems, particularly when dealing with binary classification. This occurs when the distribution of classes in the training dataset is uneven, which can lead to bias in the trained model. Fraud detection, claim prediction, default prediction, spam filtering, disease screening, churn prediction, anomaly detection, and outlier identification tasks are a few examples of imbalanced classification issues. To enhance the performance and ensure the accuracy of a model, it is crucial to address the issue of class imbalance. Predictive modeling is complicated by imbalanced datasets, but this is to be expected, as the real world consists of biased cases. By preventing the dataset from becoming biased toward one class, balancing it makes it easier to train a model. To put it another way, the model will not continue to favor the majority class solely based on having more data.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] SMOTE: Synthetic minority over-sampling technique
    Chawla, Nitesh V.
    Bowyer, Kevin W.
    Hall, Lawrence O.
    Kegelmeyer, W. Philip
    2002, American Association for Artificial Intelligence (16):
  • [2] Handling Autism Imbalanced Data using Synthetic Minority Over-Sampling Technique (SMOTE)
    El-Sayed, Asmaa Ahmed
    Meguid, Nagwa Abdel
    Mahmood, Mahmood Abdel Manem
    Hefny, Hesham Ahmed
    PROCEEDINGS OF 2015 THIRD IEEE WORLD CONFERENCE ON COMPLEX SYSTEMS (WCCS), 2015,
  • [3] Safe-Level-SMOTE: Safe-Level-Synthetic Minority Over-Sampling TEchnique for Handling the Class Imbalanced Problem
    Bunkhumpornpat, Chumphol
    Sinapiromsaran, Krung
    Lursinsap, Chidchanok
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, 5476 : 475 - 482
  • [4] On the Use of Surrounding Neighbors for Synthetic Over-Sampling of the Minority Class
    Garcia, V.
    Sanchez, J. S.
    Mollineda, R. A.
    SMO 08: PROCEEDINGS OF THE 8TH WSEAS INTERNATIONAL CONFERENCE ON SIMULATION, MODELLING AND OPTIMIZATION, 2008, : 389 - +
  • [5] LVQ-SMOTE - Learning Vector Quantization based Synthetic Minority Over-sampling Technique for biomedical data
    Nakamura, Munehiro
    Kajiwara, Yusuke
    Otsuka, Atsushi
    Kimura, Haruhiko
    BIODATA MINING, 2013, 6
  • [6] DBSMOTE: Density-Based Synthetic Minority Over-sampling TEchnique
    Bunkhumpornpat, Chumphol
    Sinapiromsaran, Krung
    Lursinsap, Chidchanok
    APPLIED INTELLIGENCE, 2012, 36 (03) : 664 - 684
  • [7] DBSMOTE: Density-Based Synthetic Minority Over-sampling TEchnique
    Chumphol Bunkhumpornpat
    Krung Sinapiromsaran
    Chidchanok Lursinsap
    Applied Intelligence, 2012, 36 : 664 - 684
  • [8] RCSMOTE: Range-Controlled synthetic minority over-sampling technique for handling the class imbalance problem
    Soltanzadeh, Paria
    Hashemzadeh, Mahdi
    INFORMATION SCIENCES, 2021, 542 : 92 - 111
  • [9] Multi-fidelity model based on synthetic minority over-sampling technique
    Jiuxiang Song
    Jizhong Liu
    Multimedia Tools and Applications, 2024, 83 : 33123 - 33139
  • [10] Imbalanced data classification using improved synthetic minority over-sampling technique
    Anusha, Yamijala
    Visalakshi, R.
    Srinivas, Konda
    MULTIAGENT AND GRID SYSTEMS, 2023, 19 (02) : 117 - 131