DSPOTE: Density-induced Selection Probability-based Oversampling TEchnique for Imbalanced Learning

被引:0
作者
Wei, Zhen [1 ]
Zhang, Li [1 ]
Zhao, Lei [1 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China
来源
2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) | 2022年
关键词
SAMPLING METHOD; SMOTE; CLASSIFICATION;
D O I
10.1109/ICPR56361.2022.9956583
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In imbalanced learning, oversampling is incredibly prevalent. However, it is disappointing that existing oversampling methods have their own limitations, such as new synthetic samples may be uninformative or noisy. To better address imbalanced learning tasks, this paper proposes a novel oversampling method, named Density-induced Selection Probability-based Over-sampling TEchnique (DSPOTE). To increase the number of samples in the minority class, DSPOTE designs a novel scheme for filtering noisy samples based on the Chebychev distance and a new way of calculating selection probability based on relative density. DSPOTE first filters noisy samples and then gets borderline ones from the minority class. Next, DSPOTE calculates the selection probabilities for all borderline samples and applies these probabilities to pick up borderline samples. Finally, DSPOTE generates synthetic samples for the minority class based on the selected borderline ones. Experimental results indicate that our method has good performance in terms of metrics, recall and AUC (Area Under the Curve), when compared with other eight methods.
引用
收藏
页码:2165 / 2171
页数:7
相关论文
共 50 条
  • [31] LSMOTE: A link-based Synthetic Minority Oversampling Technique for binary imbalanced datasets
    Cai, Qin-Nan
    Zhang, Zhong-Liang
    Wu, Yu-Heng
    Zhang, Xiu-Ming
    NEUROCOMPUTING, 2024, 608
  • [32] A Novel Oversampling Method for Imbalanced Datasets Based on Density Peaks Clustering
    Cao, Jie
    Shi, Yong
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2021, 28 (06): : 1813 - 1819
  • [33] Fuzzy-synthetic minority oversampling technique: Oversampling based on fuzzy set theory for Android malware detection in imbalanced datasets
    Xu, Yanping
    Wu, Chunhua
    Zheng, Kangfeng
    Niu, Xinxin
    Yang, Yixian
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2017, 13 (04):
  • [34] A new instance density-based synthetic minority oversampling method for imbalanced classification problems
    Ma, Chung-Kang
    Park, You-Jin
    ENGINEERING OPTIMIZATION, 2022, 54 (10) : 1743 - 1757
  • [35] EDOS: Entropy Difference-based Oversampling Approach for Imbalanced Learning
    Li, Lusi
    He, Haibo
    Li, Jie
    Li, Weijun
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [36] An Improved Oversampling Algorithm Based on the Samples' Selection Strategy for Classifying Imbalanced Data
    Xie, Wenhao
    Liang, Gongqian
    Dong, Zhonghui
    Tan, Baoyu
    Zhang, Baosheng
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2019, 2019
  • [37] MWMOTE-Majority Weighted Minority Oversampling Technique for Imbalanced Data Set Learning
    Barua, Sukarna
    Islam, Md. Monirul
    Yao, Xin
    Murase, Kazuyuki
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (02) : 405 - 425
  • [38] A Weakly Supervised Learning-Based Oversampling Framework for Class-Imbalanced Fault Diagnosis
    Qian, Min
    Li, Yan-Fu
    IEEE TRANSACTIONS ON RELIABILITY, 2022, 71 (01) : 429 - 442
  • [39] Improving imbalanced learning through a heuristic oversampling method based on k-means and SMOTE
    Douzas, Georgios
    Bacao, Fernando
    Last, Felix
    INFORMATION SCIENCES, 2018, 465 : 1 - 20
  • [40] Genetic Algorithm-Based Oversampling Technique to Learn from Imbalanced Data
    Saladi, Puneeth Srinivas Mohan
    Dash, Tirtharaj
    SOFT COMPUTING FOR PROBLEM SOLVING, SOCPROS 2017, VOL 1, 2019, 816 : 387 - 397