A NOVEL RULE-BASED OVERSAMPLING APPROACH FOR IMBALANCED DATA CLASSIFICATION

被引:0
|
作者
Zhang, Xiao [1 ]
Paz, Ivan [1 ]
Nebot, Angela [1 ]
机构
[1] Univ Politecn Cataluna, Soft Comp Res Grp, Intelligent Data Sci & Artificial Intelligence Re, Barcelona, Spain
关键词
Rule-based approach; Oversampling; Data synthesis; Imbalanced data; Classification; DATA-SETS; SMOTE;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
When confronted with imbalanced datasets, traditional classifiers frequently struggle to correctly categorize samples from the minority class, adversely impacting the overall predictive performance of machine learning models. Current oversampling techniques generally focus on data interpolation through neighbor selection, often neglecting to uncover underlying data structures and relationships. This study introduces a novel application for RuLer, an algorithm originally developed for identifying sound patterns in the artistic domain of live coding. When adapted for data oversampling (as Ad-RuLer), the algorithm shows significant promise in addressing the challenges associated with imbalanced class distribution. We undertake a thorough comparative evaluation of Ad-RuLer against established oversampling algorithms such as SMOTE, ADASYN, Tomek-links, Borderline-SMOTE, and KmeansSMOTE. The evaluation employs various classifiers including logistic regression, random forest, and XGBoost, and is conducted over six real-world biomedical datasets with varying degrees of imbalance.
引用
收藏
页码:208 / 212
页数:5
相关论文
共 50 条
  • [31] A Rule-Based Classification Algorithm for Uncertain Data
    Qin, Biao
    Xia, Yuni
    Prabhakar, Sunil
    Tu, Yicheng
    ICDE: 2009 IEEE 25TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2009, : 1633 - +
  • [32] A new rule-based video classification approach
    Yuan, Y
    Shen, JY
    Song, QB
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 225 - 230
  • [33] Classification of IUE spectra: A rule-based approach
    Rampazzo, R.
    Heck, A.
    Murtagh, F.
    ESA journal, 1988, 12 (03): : 385 - 394
  • [34] CLASSIFICATION OF IUE SPECTRA - A RULE-BASED APPROACH
    RAMPAZZO, R
    HECK, A
    MURTAGH, F
    ESA JOURNAL-EUROPEAN SPACE AGENCY, 1988, 12 (03): : 385 - 394
  • [35] A Genetic Learning of the Fuzzy Rule-Based Classification System Granularity for highly Imbalanced Data-Sets
    Villar, Pedro
    Fernandez, Alberto
    Herrera, Francisco
    2009 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, 2009, : 1689 - +
  • [36] Noise-robust oversampling for imbalanced data classification
    Liu, Yongxu
    Liu, Yan
    Yu, Bruce X. B.
    Zhong, Shenghua
    Hu, Zhejing
    PATTERN RECOGNITION, 2023, 133
  • [37] Oversampling for Imbalanced Data Classification Using Adversarial Network
    Lee, Sang-Kwang
    Hong, Seung-Jin
    Yang, Seong-Il
    2018 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC), 2018, : 1255 - 1257
  • [38] Oversampling boosting for classification of imbalanced software defect data
    Li, Guangling
    Wang, Shihai
    PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 4149 - 4154
  • [39] Designing the rule classification with oversampling approach with high accuracy for imbalanced data in semiconductor production lines (vol 81, 36437, 2021)
    Wang, Hsiao-Yu
    Tsung, Chen-Kun
    Hung, Ching-Hua
    Chen, Chen-Huei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (18) : 28671 - 28671
  • [40] Sample and rule centric approach for associative classification on imbalanced data
    Yang G.
    Cui X.
    Zhang X.
    Xitong Gongcheng Lilun yu Shijian/System Engineering Theory and Practice, 2017, 37 (04): : 1035 - 1045