共 50 条
Imbalance: Oversampling algorithms for imbalanced classification in R
被引:59
|作者:
Cordon, Ignacio
[1
]
Garcia, Salvador
[1
]
Fernandez, Alberto
[1
]
Herrera, Francisco
[1
]
机构:
[1] Univ Granada, DaSCI Andalusian Inst Data Sci & Computat Intelli, Granada, Spain
关键词:
Oversampling;
Imbalanced classification;
Machine learning;
Preprocessing;
SMOTE;
SOFTWARE;
SMOTE;
D O I:
10.1016/j.knosys.2018.07.035
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
Addressing imbalanced datasets in classification tasks is a relevant topic in research studies. The main reason is that for standard classification algorithms, the success rate when identifying minority class instances may be adversely affected. Among different solutions to cope with this problem, data level techniques have shown a robust behavior. In this paper, the novel imbalance package is introduced. Written in R and C++, and available at CRAN repository, this library includes recent relevant oversampling algorithms to improve the quality of data in imbalanced datasets, prior to performing a learning task. The main features of the package, as well as some illustrative examples of its use are detailed throughout this manuscript. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:329 / 341
页数:13
相关论文