Combining One-vs-One Decomposition and Ensemble Learning for Multi-class Imbalanced Data

被引:6
作者
Krawczyk, Bartosz [1 ]
机构
[1] Wroclaw Univ Technol, Dept Syst & Comp Networks, Wroclaw, Poland
来源
PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS, CORES 2015 | 2016年 / 403卷
关键词
Ensemble classifiers; Imbalanced data; Multi-class classification; Pairwise learning; Binary decomposition; CLASSIFICATION; CLASSIFIERS;
D O I
10.1007/978-3-319-26227-7_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning from imbalanced data poses significant challenges for machine learning algorithms, as they need to deal with uneven distribution of examples in the training set. As standard classifiers will be biased toward the majority class there exist a need for specific methods than can overcome this single-class dominance. Most of works concentrated on binary problems, where majority and minority class can be distinguished. But a more challenging problem arises when imbalance is present within multi-class datasets, as relations between classes tend to complicate. One class can be a minority class for some, while a majority for others. In this paper, we propose an efficient method for handling such scenarios that combines the problem decomposition with ensemble learning. According to divide-and-conquer rule, we decompose our multi-class data into a number of binary subproblems using one-versus-one approach. To each simplified task we delegate a ensemble of classifiers dedicated to binary imbalanced problems. Then using a dedicated classifier fusion approach, we reconstruct the original multi-class problem. Experimental analysis backed-up with statistical testing clearly proves that such an approach is superior to state-of-the art ad hoc and decomposition methods used in the literature.
引用
收藏
页码:27 / 36
页数:10
相关论文
共 18 条
[1]   An experimental study on evolutionary fuzzy classifiers designed for managing imbalanced datasets [J].
Antonelli, Michela ;
Ducange, Pietro ;
Marcelloni, Francesco .
NEUROCOMPUTING, 2014, 146 :125-136
[2]   New applications of ensembles of classifiers [J].
Barandela, R ;
Sánchez, JS ;
Valdovinos, RM .
PATTERN ANALYSIS AND APPLICATIONS, 2003, 6 (03) :245-256
[3]   Classifier fusion with interval-valued weights [J].
Burduk, Robert .
PATTERN RECOGNITION LETTERS, 2013, 34 (14) :1623-1629
[4]  
Cyganek B, 2006, LECT NOTES COMPUT SC, V3973, P52
[5]  
Cyganek B, 2012, LECT NOTES COMPUT SC, V7653, P104, DOI 10.1007/978-3-642-34630-9_11
[6]   Multithreshold Entropy Linear Classifier: Theory and applications [J].
Czarnecki, Wojciech Marian ;
Tabor, Jacek .
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (13) :5591-5606
[7]   Analysing the classification of imbalanced data-sets with multiple classes: Binarization techniques and ad-hoc approaches [J].
Fernandez, Alberto ;
Lopez, Victoria ;
Galar, Mikel ;
Jose del Jesus, Maria ;
Herrera, Francisco .
KNOWLEDGE-BASED SYSTEMS, 2013, 42 :97-110
[8]   A dynamic over-sampling procedure based on sensitivity for multi-class problems [J].
Fernandez-Navarro, Francisco ;
Hervas-Martinez, Cesar ;
Antonio Gutierrez, Pedro .
PATTERN RECOGNITION, 2011, 44 (08) :1821-1833
[9]   A Review on Ensembles for the Class Imbalance Problem: Bagging-, Boosting-, and Hybrid-Based Approaches [J].
Galar, Mikel ;
Fernandez, Alberto ;
Barrenechea, Edurne ;
Bustince, Humberto ;
Herrera, Francisco .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (04) :463-484
[10]   DRCW-OVO: Distance-based relative competence weighting combination for One-vs-One strategy in multi-class problems [J].
Galar, Mikel ;
Fernandez, Alberto ;
Barrenechea, Edurne ;
Herrera, Francisco .
PATTERN RECOGNITION, 2015, 48 (01) :28-42