A NOVEL RULE-BASED OVERSAMPLING APPROACH FOR IMBALANCED DATA CLASSIFICATION

被引:0
|
作者
Zhang, Xiao [1 ]
Paz, Ivan [1 ]
Nebot, Angela [1 ]
机构
[1] Univ Politecn Cataluna, Soft Comp Res Grp, Intelligent Data Sci & Artificial Intelligence Re, Barcelona, Spain
来源
37TH ANNUAL EUROPEAN SIMULATION AND MODELLING CONFERENCE 2023, ESM 2023 | 2023年
关键词
Rule-based approach; Oversampling; Data synthesis; Imbalanced data; Classification; DATA-SETS; SMOTE;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
When confronted with imbalanced datasets, traditional classifiers frequently struggle to correctly categorize samples from the minority class, adversely impacting the overall predictive performance of machine learning models. Current oversampling techniques generally focus on data interpolation through neighbor selection, often neglecting to uncover underlying data structures and relationships. This study introduces a novel application for RuLer, an algorithm originally developed for identifying sound patterns in the artistic domain of live coding. When adapted for data oversampling (as Ad-RuLer), the algorithm shows significant promise in addressing the challenges associated with imbalanced class distribution. We undertake a thorough comparative evaluation of Ad-RuLer against established oversampling algorithms such as SMOTE, ADASYN, Tomek-links, Borderline-SMOTE, and KmeansSMOTE. The evaluation employs various classifiers including logistic regression, random forest, and XGBoost, and is conducted over six real-world biomedical datasets with varying degrees of imbalance.
引用
收藏
页码:208 / 212
页数:5
相关论文
共 50 条
  • [21] Hyperspectral Image Classification with Imbalanced Data Based on Oversampling and Convolutional Neural Network
    Cai, Lei
    Zhang, Geng
    AI IN OPTICS AND PHOTONICS (AOPC 2019), 2019, 11342
  • [22] A novel oversampling and feature selection hybrid algorithm for imbalanced data classification
    Fang Feng
    Kuan-Ching Li
    Erfu Yang
    Qingguo Zhou
    Lihong Han
    Amir Hussain
    Mingjiang Cai
    Multimedia Tools and Applications, 2023, 82 : 3231 - 3267
  • [23] A quantum-based oversampling method for classification of highly imbalanced and overlapped data
    Yang, Bei
    Tian, Guilan
    Luttrell, Joseph
    Gong, Ping
    Zhang, Chaoyang
    EXPERIMENTAL BIOLOGY AND MEDICINE, 2023, 248 (24) : 2500 - 2513
  • [24] Distance-based arranging oversampling technique for imbalanced data
    Dai, Qi
    Liu, Jian-wei
    Zhao, Jia-Liang
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (02) : 1323 - 1342
  • [25] An oversampling framework for imbalanced classification based on Laplacian eigenmaps
    Ye, Xiucai
    Li, Hongmin
    Imakura, Akira
    Sakurai, Tetsuya
    NEUROCOMPUTING, 2020, 399 : 107 - 116
  • [26] Multi-oversampling with Evidence Fusion for Imbalanced Data Classification
    Tian, Hongpeng
    Zhang, Zuowei
    Liu, Zhunga
    Zuo, Jingwei
    BELIEF FUNCTIONS: THEORY AND APPLICATIONS, BELIEF 2024, 2024, 14909 : 68 - 77
  • [27] Distance-based arranging oversampling technique for imbalanced data
    Qi Dai
    Jian-wei Liu
    Jia-Liang Zhao
    Neural Computing and Applications, 2023, 35 : 1323 - 1342
  • [28] A non-parameter oversampling approach for imbalanced data classification based on hybrid natural neighbors
    Lin, Junyue
    Liang, Lu
    APPLIED INTELLIGENCE, 2025, 55 (05)
  • [29] Improving interpolation-based oversampling for imbalanced data learning
    Zhu, Tuanfei
    Lin, Yaping
    Liu, Yonghe
    KNOWLEDGE-BASED SYSTEMS, 2020, 187
  • [30] Grouping-based Oversampling in Kernel Space for Imbalanced Data Classification
    Ren, Jinjun
    Wang, Yuping
    Cheung, Yiu-ming
    Gao, Xiao-Zhi
    Guo, Xiaofang
    PATTERN RECOGNITION, 2023, 133