Binarization With Boosting and Oversampling for Multiclass Classification

被引:34
作者
Sen, Ayon [1 ]
Islam, Md. Monirul [2 ]
Murase, Kazuyuki [3 ]
Yao, Xin [4 ]
机构
[1] Univ Wisconsin, Dept Comp Sci, 1210 W Dayton St, Madison, WI 53706 USA
[2] Bangladesh Univ Engn & Technol, Dept Comp Sci & Engn, Dhaka 1000, Bangladesh
[3] Univ Fukui, Dept Human & Artificial Intelligence Syst, Fukui 9108507, Japan
[4] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, W Midlands, England
关键词
Binarization; boosting; multiclass classification; oversampling; STRATEGY;
D O I
10.1109/TCYB.2015.2423295
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Using a set of binary classifiers to solve multiclass classification problems has been a popular approach over the years. The decision boundaries learnt by binary classifiers (also called base classifiers) are much simpler than those learnt by multiclass classifiers. This paper proposes a new classification framework, termed binarization with boosting and oversampling (BBO), for efficiently solving multiclass classification problems. The new framework is devised based on the one-versus-all (OVA) binarization technique. Unlike most previous work, BBO employs boosting for solving the hard-to-learn instances and oversampling for handling the class-imbalance problem arising due to OVA binarization. These two features make BBO different from other existing works. Our new framework has been tested extensively on several multiclass supervised and semi-supervised classification problems using five different base classifiers, including neural networks, C4.5, k-nearest neighbor, repeated incremental pruning to produce error reduction, support vector machine, random forest, and learning with local and global consistency. Experimental results show that BBO can exhibit better performance compared to its counterparts on supervised and semi-supervised classification problems.
引用
收藏
页码:1078 / 1091
页数:14
相关论文
共 48 条
[11]  
Chapelle O., 2006, SEMISUPERVISED LEARN
[12]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[13]  
Cohen W. W., 1995, Machine Learning. Proceedings of the Twelfth International Conference on Machine Learning, P115
[14]  
Cutzu F, 2003, LECT NOTES COMPUT SC, V2709, P115
[15]  
Delachaux B, 2013, LECT NOTES COMPUT SC, V7903, P216
[16]   Solving multi-class problems with linguistic fuzzy rule based classification systems based on pairwise learning and preference relations [J].
Fernandez, Alberto ;
Calderon, Maria ;
Barrenechea, Edurne ;
Bustince, Humberto ;
Herrera, Francisco .
FUZZY SETS AND SYSTEMS, 2010, 161 (23) :3064-3080
[17]  
Furnkranz J., 2003, Intelligent Data Analysis, V7, P385
[18]  
Fürnkranz J, 2003, LECT NOTES ARTIF INT, V2837, P145
[19]   Round robin classification [J].
Fürnkranz, J .
JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (04) :721-747
[20]   An overview of ensemble methods for binary classifiers in multi-class problems: Experimental study on one-vs-one and one-vs-all schemes [J].
Galar, Mikel ;
Fernandez, Alberto ;
Barrenechea, Edurne ;
Bustince, Humberto ;
Herrera, Francisco .
PATTERN RECOGNITION, 2011, 44 (08) :1761-1776