A NOVEL RULE-BASED OVERSAMPLING APPROACH FOR IMBALANCED DATA CLASSIFICATION

被引：0

作者：

Zhang, Xiao ^{[1
]}

Paz, Ivan ^{[1
]}

Nebot, Angela ^{[1
]}

机构：

[1] Univ Politecn Cataluna, Soft Comp Res Grp, Intelligent Data Sci & Artificial Intelligence Re, Barcelona, Spain

来源：

37TH ANNUAL EUROPEAN SIMULATION AND MODELLING CONFERENCE 2023, ESM 2023 | 2023年

关键词：

Rule-based approach; Oversampling; Data synthesis; Imbalanced data; Classification; DATA-SETS; SMOTE;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

When confronted with imbalanced datasets, traditional classifiers frequently struggle to correctly categorize samples from the minority class, adversely impacting the overall predictive performance of machine learning models. Current oversampling techniques generally focus on data interpolation through neighbor selection, often neglecting to uncover underlying data structures and relationships. This study introduces a novel application for RuLer, an algorithm originally developed for identifying sound patterns in the artistic domain of live coding. When adapted for data oversampling (as Ad-RuLer), the algorithm shows significant promise in addressing the challenges associated with imbalanced class distribution. We undertake a thorough comparative evaluation of Ad-RuLer against established oversampling algorithms such as SMOTE, ADASYN, Tomek-links, Borderline-SMOTE, and KmeansSMOTE. The evaluation employs various classifiers including logistic regression, random forest, and XGBoost, and is conducted over six real-world biomedical datasets with varying degrees of imbalance.

引用

页码：208 / 212

页数：5

共 50 条

[21] Hyperspectral Image Classification with Imbalanced Data Based on Oversampling and Convolutional Neural Network
Cai, Lei
Zhang, Geng
AI IN OPTICS AND PHOTONICS (AOPC 2019), 2019, 11342
[22] A novel oversampling and feature selection hybrid algorithm for imbalanced data classification
Fang Feng
Kuan-Ching Li
Erfu Yang
Qingguo Zhou
Lihong Han
Amir Hussain
Mingjiang Cai
Multimedia Tools and Applications, 2023, 82 : 3231 - 3267
[23] A quantum-based oversampling method for classification of highly imbalanced and overlapped data
Yang, Bei
Tian, Guilan
Luttrell, Joseph
Gong, Ping
Zhang, Chaoyang
EXPERIMENTAL BIOLOGY AND MEDICINE, 2023, 248 (24) : 2500 - 2513
[24] Distance-based arranging oversampling technique for imbalanced data
Dai, Qi
Liu, Jian-wei
Zhao, Jia-Liang
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (02) : 1323 - 1342
[25] An oversampling framework for imbalanced classification based on Laplacian eigenmaps
Ye, Xiucai
Li, Hongmin
Imakura, Akira
Sakurai, Tetsuya
NEUROCOMPUTING, 2020, 399 : 107 - 116
[26] Multi-oversampling with Evidence Fusion for Imbalanced Data Classification
Tian, Hongpeng
Zhang, Zuowei
Liu, Zhunga
Zuo, Jingwei
BELIEF FUNCTIONS: THEORY AND APPLICATIONS, BELIEF 2024, 2024, 14909 : 68 - 77
[27] Distance-based arranging oversampling technique for imbalanced data
Qi Dai
Jian-wei Liu
Jia-Liang Zhao
Neural Computing and Applications, 2023, 35 : 1323 - 1342
[28] A non-parameter oversampling approach for imbalanced data classification based on hybrid natural neighbors
Lin, Junyue
Liang, Lu
APPLIED INTELLIGENCE, 2025, 55 (05)
[29] Improving interpolation-based oversampling for imbalanced data learning
Zhu, Tuanfei
Lin, Yaping
Liu, Yonghe
KNOWLEDGE-BASED SYSTEMS, 2020, 187
[30] Grouping-based Oversampling in Kernel Space for Imbalanced Data Classification
Ren, Jinjun
Wang, Yuping
Cheung, Yiu-ming
Gao, Xiao-Zhi
Guo, Xiaofang
PATTERN RECOGNITION, 2023, 133

← 1 2 3 4 5 →