A NOVEL RULE-BASED OVERSAMPLING APPROACH FOR IMBALANCED DATA CLASSIFICATION

被引:0
|
作者
Zhang, Xiao [1 ]
Paz, Ivan [1 ]
Nebot, Angela [1 ]
机构
[1] Univ Politecn Cataluna, Soft Comp Res Grp, Intelligent Data Sci & Artificial Intelligence Re, Barcelona, Spain
来源
37TH ANNUAL EUROPEAN SIMULATION AND MODELLING CONFERENCE 2023, ESM 2023 | 2023年
关键词
Rule-based approach; Oversampling; Data synthesis; Imbalanced data; Classification; DATA-SETS; SMOTE;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
When confronted with imbalanced datasets, traditional classifiers frequently struggle to correctly categorize samples from the minority class, adversely impacting the overall predictive performance of machine learning models. Current oversampling techniques generally focus on data interpolation through neighbor selection, often neglecting to uncover underlying data structures and relationships. This study introduces a novel application for RuLer, an algorithm originally developed for identifying sound patterns in the artistic domain of live coding. When adapted for data oversampling (as Ad-RuLer), the algorithm shows significant promise in addressing the challenges associated with imbalanced class distribution. We undertake a thorough comparative evaluation of Ad-RuLer against established oversampling algorithms such as SMOTE, ADASYN, Tomek-links, Borderline-SMOTE, and KmeansSMOTE. The evaluation employs various classifiers including logistic regression, random forest, and XGBoost, and is conducted over six real-world biomedical datasets with varying degrees of imbalance.
引用
收藏
页码:208 / 212
页数:5
相关论文
共 50 条
  • [41] MKC-SMOTE: A Novel Synthetic Oversampling Method for Multi-Class Imbalanced Data Classification
    Wang, Jiao
    Awang, Norhashidah
    IEEE ACCESS, 2024, 12 : 196929 - 196938
  • [42] Self-adaptive oversampling method based on the complexity of minority data in imbalanced datasets classification
    Tao, Xinmin
    Guo, Xinyue
    Zheng, Yujia
    Zhang, Xiaohan
    Chen, Zhiyu
    KNOWLEDGE-BASED SYSTEMS, 2023, 277
  • [43] Combining Random Subspace Approach with smote Oversampling for Imbalanced Data Classification
    Ksieniewicz, Pawel
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, HAIS 2019, 2019, 11734 : 660 - 673
  • [44] A fuzzy association rule-based classifier for imbalanced classification problems
    Sanz, J.
    Sesma-Sara, M.
    Bustince, H.
    INFORMATION SCIENCES, 2021, 577 : 265 - 279
  • [45] Imbalanced Learning with Oversampling based on Classification Contribution Degree
    Jiang, Zhenhao
    Yang, Jie
    Liu, Yan
    ADVANCED THEORY AND SIMULATIONS, 2021, 4 (05)
  • [46] Counterfactual-based minority oversampling for imbalanced classification
    Wang, Shu
    Luo, Hao
    Huang, Shanshan
    Li, Qingsong
    Liu, Li
    Su, Guoxin
    Liu, Ming
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 122
  • [47] Constrained Oversampling: An Oversampling Approach to Reduce Noise Generation in Imbalanced Datasets With Class Overlapping
    Liu, Changhui
    Jin, Sun
    Wang, Donghong
    Luo, Zichao
    Yu, Jianbo
    Zhou, Binghai
    Yang, Changlin
    IEEE ACCESS, 2022, 10 : 91452 - 91465
  • [48] A new rule-based knowledge extraction approach for imbalanced datasets
    Mahani, Aouatef
    Baba-Ali, Ahmed Riadh
    KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 61 (03) : 1303 - 1329
  • [49] A new rule-based knowledge extraction approach for imbalanced datasets
    Aouatef Mahani
    Ahmed Riadh Baba-Ali
    Knowledge and Information Systems, 2019, 61 : 1303 - 1329
  • [50] Global-local information based oversampling for multi-class imbalanced data
    Han, Mingming
    Guo, Husheng
    Li, Jinyan
    Wang, Wenjian
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (06) : 2071 - 2086