Improved SMOTE algorithm for imbalanced dataset

被引:0
|
作者
Zheng Hengyu [1 ]
机构
[1] South China Univ Technol, Sch Automat Sci & Engn, Guangzhou, Peoples R China
关键词
SMOTE; Unbalanced dataset; SVM; Confusion Matrix;
D O I
10.1109/CAC51589.2020.9326603
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
When applying traditional classifiers to imbalanced dataset, the result might be bias towards the majority class, which leads to poor performance of classifiers. Synthetic Minority Oversampling Technique(SMOTE) is a popular algorithm to improve the classifier's performance through generating new minority samples and making dataset balanced. Based on SMOTE, two new over-sampling algorithms DSMOTE and ESMOTE are proposed in this paper. Being different with SMOTE which treats all minority samples equally, the two new over-sampling algorithms mainly synthesize new samples near the easily misclassified samples to improve the classification accuracy of minority class. Experiments show that DSMOTE and ESMOTE could both get better performance than SMOTE.
引用
收藏
页码:693 / 697
页数:5
相关论文
共 50 条
  • [31] A Classification Method of Imbalanced Big Data Based on Improved SMOTE and Stacked LSTM
    Xu, Wentao
    Journal of Network Intelligence, 2023, 8 (01): : 100 - 112
  • [32] An Improved Method of Detecting Macro Malware on an Imbalanced Dataset
    Mimura, Mamoru
    IEEE ACCESS, 2020, 8 : 204709 - 204717
  • [33] An Improved MAHAKIL Oversampling Method for Imbalanced Dataset Classification
    Zhang, Yong
    Zuo, Tingting
    Fang, Lichao
    Li, Jun
    Xing, Zongyi
    IEEE ACCESS, 2021, 9 : 16030 - 16040
  • [34] An effective convolutional neural network based on SMOTE and Gaussian mixture model for intrusion detection in imbalanced dataset
    Zhang, Hongpo
    Huang, Lulu
    Wu, Chase Q.
    Li, Zhanbo
    COMPUTER NETWORKS, 2020, 177
  • [35] An improved SMOTE based on center offset factor and synthesis strategy for imbalanced data classification
    Zhang, Ying
    Deng, Li
    Huang, Hefeng
    Wei, Bo
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (15): : 22479 - 22519
  • [36] An Improved SMOTE Intelligent Algorithm for Data Set Reconstruction
    Xu, Yingcheng
    Feng, Wei
    Pei, Fei
    Wang, Haiyan
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 127 : 128 - 129
  • [37] Imbalanced Data Classification Based on Improved Random-SMOTE and Feature Standard Deviation
    Zhang, Ying
    Deng, Li
    Wei, Bo
    MATHEMATICS, 2024, 12 (11)
  • [38] Improved SMOTE based LSDA for Class Imbalanced Fault Diagnosis in TE Industrial Process
    Zhang, Zi-Yang
    Xia, Tao
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 1758 - 1762
  • [39] Improved AdaBoost Model for User's QoE in Imbalanced Dataset
    Liu, Qifeng
    Wei, Xin
    Huang, Ruochen
    Meng, Hao
    Qian, Yi
    2016 8TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS & SIGNAL PROCESSING (WCSP), 2016,
  • [40] Enhanced SMOTE Algorithm for Classification of Imbalanced Big-Data using Random Forest
    Bhagat, Reshma C.
    Patil, Sachin S.
    2015 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2015, : 403 - 408