Imbalance: Oversampling algorithms for imbalanced classification in R

被引:59
|
作者
Cordon, Ignacio [1 ]
Garcia, Salvador [1 ]
Fernandez, Alberto [1 ]
Herrera, Francisco [1 ]
机构
[1] Univ Granada, DaSCI Andalusian Inst Data Sci & Computat Intelli, Granada, Spain
关键词
Oversampling; Imbalanced classification; Machine learning; Preprocessing; SMOTE; SOFTWARE; SMOTE;
D O I
10.1016/j.knosys.2018.07.035
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Addressing imbalanced datasets in classification tasks is a relevant topic in research studies. The main reason is that for standard classification algorithms, the success rate when identifying minority class instances may be adversely affected. Among different solutions to cope with this problem, data level techniques have shown a robust behavior. In this paper, the novel imbalance package is introduced. Written in R and C++, and available at CRAN repository, this library includes recent relevant oversampling algorithms to improve the quality of data in imbalanced datasets, prior to performing a learning task. The main features of the package, as well as some illustrative examples of its use are detailed throughout this manuscript. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:329 / 341
页数:13
相关论文
共 50 条
  • [41] Weighted oversampling algorithms for imbalanced problems and application in prediction of streamflow
    Zhou, Hao
    Dong, Xianyong
    Xia, Shuyin
    Wang, Guoyin
    KNOWLEDGE-BASED SYSTEMS, 2021, 229
  • [42] A cross-validation framework to find a better state than the balanced one for oversampling in imbalanced classification
    Dai, Qizhu
    Li, Donggen
    Xia, Shuyin
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (08) : 2877 - 2886
  • [43] Hyperspectral Image Classification with Imbalanced Data Based on Oversampling and Convolutional Neural Network
    Cai, Lei
    Zhang, Geng
    AI IN OPTICS AND PHOTONICS (AOPC 2019), 2019, 11342
  • [44] A Novel Adaptive Minority Oversampling Technique for Improved Classification in Data Imbalanced Scenarios
    Tripathi, Ayush
    Chakraborty, Rupayan
    Kopparapu, Sunil Kumar
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 10650 - 10657
  • [45] A quantum-based oversampling method for classification of highly imbalanced and overlapped data
    Yang, Bei
    Tian, Guilan
    Luttrell, Joseph
    Gong, Ping
    Zhang, Chaoyang
    EXPERIMENTAL BIOLOGY AND MEDICINE, 2023, 248 (24) : 2500 - 2513
  • [46] Subspace-based minority oversampling for imbalance classification
    Li, Tianjun
    Wang, Yingxu
    Liu, Licheng
    Chen, Long
    Chen, C. L. Philip
    INFORMATION SCIENCES, 2023, 621 : 371 - 388
  • [47] IA-SUWO: An Improving Adaptive semi-unsupervised weighted oversampling for imbalanced classification problems
    Wei Jianan
    Huang Haisong
    Yao Liguo
    Hu Yao
    Fan Qingsong
    Huang Dong
    KNOWLEDGE-BASED SYSTEMS, 2020, 203
  • [48] Stop Oversampling for Class Imbalance Learning: A Review
    Tarawneh, Ahmad S.
    Hassanat, Ahmad B.
    Altarawneh, Ghada Awad
    Almuhaimeed, Abdullah
    IEEE ACCESS, 2022, 10 : 47643 - 47660
  • [49] Evidence-based adaptive oversampling algorithm for imbalanced classification
    Chen-ju Lin
    Florence Leony
    Knowledge and Information Systems, 2024, 66 : 2209 - 2233
  • [50] Radial-Based Approach to Imbalanced Data Oversampling
    Koziarski, Michal
    Krawczyk, Bartosz
    Wozniak, Michal
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, HAIS 2017, 2017, 10334 : 318 - 327