DSPOTE: Density-induced Selection Probability-based Oversampling TEchnique for Imbalanced Learning

被引:0
作者
Wei, Zhen [1 ]
Zhang, Li [1 ]
Zhao, Lei [1 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China
来源
2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) | 2022年
关键词
SAMPLING METHOD; SMOTE; CLASSIFICATION;
D O I
10.1109/ICPR56361.2022.9956583
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In imbalanced learning, oversampling is incredibly prevalent. However, it is disappointing that existing oversampling methods have their own limitations, such as new synthetic samples may be uninformative or noisy. To better address imbalanced learning tasks, this paper proposes a novel oversampling method, named Density-induced Selection Probability-based Over-sampling TEchnique (DSPOTE). To increase the number of samples in the minority class, DSPOTE designs a novel scheme for filtering noisy samples based on the Chebychev distance and a new way of calculating selection probability based on relative density. DSPOTE first filters noisy samples and then gets borderline ones from the minority class. Next, DSPOTE calculates the selection probabilities for all borderline samples and applies these probabilities to pick up borderline samples. Finally, DSPOTE generates synthetic samples for the minority class based on the selected borderline ones. Experimental results indicate that our method has good performance in terms of metrics, recall and AUC (Area Under the Curve), when compared with other eight methods.
引用
收藏
页码:2165 / 2171
页数:7
相关论文
共 50 条
  • [41] Minimum Bayesian error probability-based gene subset selection
    Li, Jian
    Yu, Tian
    Wei, Jin-Mao
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2015, 12 (04) : 434 - 450
  • [42] SVDD boundary and DPC clustering technique-based oversampling approach for handling imbalanced and overlapped data
    Tao, Xinmin
    Chen, Wei
    Zhang, Xiaohan
    Guo, Wenjie
    Qi, Lin
    Fan, Zhiting
    KNOWLEDGE-BASED SYSTEMS, 2021, 234
  • [43] SPAW-SMOTE: Space Partitioning Adaptive Weighted Synthetic Minority Oversampling Technique For Imbalanced Data Set Learning
    Zhang, Qiang
    He, Junjiang
    Li, Tao
    Lan, Xiaolong
    Fang, Wenbo
    Li, Yihong
    COMPUTER JOURNAL, 2023, 67 (05) : 1747 - 1762
  • [44] A novel synthetic minority oversampling technique based on relative and absolute densities for imbalanced classification
    Liu, Ruijuan
    APPLIED INTELLIGENCE, 2023, 53 (01) : 786 - 803
  • [45] CMO-SMOTE: Misclassification Cost Minimization Oriented Synthetic Minority Oversampling Technique for Imbalanced Learning
    Zhou, Changsheng
    Liu, Bin
    Wang, Shihai
    2016 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC), VOL. 2, 2016, : 353 - 358
  • [46] Clustering-based improved adaptive synthetic minority oversampling technique for imbalanced data classification
    Jin, Dian
    Xie, Dehong
    Liu, Di
    Gong, Murong
    INTELLIGENT DATA ANALYSIS, 2023, 27 (03) : 635 - 652
  • [47] A Synthetic Minority Oversampling Technique Based on Gaussian Mixture Model Filtering for Imbalanced Data Classification
    Xu, Zhaozhao
    Shen, Derong
    Kou, Yue
    Nie, Tiezheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3740 - 3753
  • [48] A GAN and Feature Selection-Based Oversampling Technique for Intrusion Detection
    Liu, Xiaodong
    Li, Tong
    Zhang, Runzi
    Wu, Di
    Liu, Yongheng
    Yang, Zhen
    SECURITY AND COMMUNICATION NETWORKS, 2021, 2021
  • [49] CL-SR: Boosting Imbalanced Image Classification with Contrastive Learning and Synthetic Minority Oversampling Technique Based on Rough Set Theory Integration
    Gao, Xiaoling
    Jamil, Nursuriati
    Ramli, Muhammad Izzad
    APPLIED SCIENCES-BASEL, 2024, 14 (23):
  • [50] SA-CGAN: An oversampling method based on single attribute guided conditional GAN for multi-class imbalanced learning
    Dong, Yongfeng
    Xiao, Huaxin
    Dong, Yao
    NEUROCOMPUTING, 2022, 472 : 326 - 337