Empowering difficult classes with a similarity-based aggregation in multi-class classification problems

被引:35
作者
Galar, Mikel [1 ]
Fernandez, Alberto [2 ]
Barrenechea, Edurne [1 ]
Herrera, Francisco [3 ,4 ]
机构
[1] Univ Publ Navarra, Dept Automat & Computac, Pamplona 31006, Spain
[2] Univ Jaen, Dept Comp Sci, Jaen 23071, Spain
[3] Univ Granada, Dept Comp Sci & Artificial Intelligence, E-18071 Granada, Spain
[4] King Abdulaziz Univ, Fac Comp & Informat Technol North Jeddah, Jeddah 21589, Saudi Arabia
关键词
Multi-class classification; Pairwise learning; One-vs-One; Decomposition strategies; Tuning; Difficult classes; RESTRICTED EQUIVALENCE FUNCTIONS; STATISTICAL COMPARISONS; SOFTWARE TOOL; ALGORITHMS; CLASSIFIERS; STRATEGY; KEEL;
D O I
10.1016/j.ins.2013.12.053
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
One-vs-One strategy divides the original multi-class problem into as many binary classification problems as pairs of classes. Then, independent base classifiers are learned to face each problem, whose outputs are combined to predict a single class label. This way, the accuracy of the baseline classifiers without decomposition is usually enhanced, aside from enabling the usage of binary classifiers, i.e., Support Vector Machines, to solve multi-class problems. This paper analyzes the fact that existing aggregations favor easily recognizable classes; hence, the accuracy enhancement mainly comes from the higher correct classification rates over these classes. Using other evaluation criteria, the significant improvements of One-vs-One are diminished, showing a weakness due to the presence of difficult classes. Difficult classes can be defined as those obtaining a lower correct classification rate than that obtained by the other classes in the problem. After studying the problem of difficult classes in this framework and aiming to empower these classes, a novel similarity-based aggregation is presented, which generalizes the well-known weighted voting. The experimental analysis shows that the new methodology is able to increase the recognition of difficult classes, obtaining a more balanced performance over all classes, which is a desirable behavior. The methodology is tested within several Machine Learning paradigms and is compared with the state-of-the-art on aggregations for One-vs-One strategy. The results are contrasted by the proper statistical tests, as suggested in the literature. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:135 / 157
页数:23
相关论文
共 65 条
[1]  
AHA DW, 1991, MACH LEARN, V6, P37, DOI 10.1007/BF00153759
[2]   KEEL: a software tool to assess evolutionary algorithms for data mining problems [J].
Alcala-Fdez, J. ;
Sanchez, L. ;
Garcia, S. ;
del Jesus, M. J. ;
Ventura, S. ;
Garrell, J. M. ;
Otero, J. ;
Romero, C. ;
Bacardit, J. ;
Rivas, V. M. ;
Fernandez, J. C. ;
Herrera, F. .
SOFT COMPUTING, 2009, 13 (03) :307-318
[3]  
Alcalá-Fdez J, 2011, J MULT-VALUED LOG S, V17, P255
[4]   Reducing multiclass to binary: A unifying approach for margin classifiers [J].
Allwein, EL ;
Schapire, RE ;
Singer, Y .
JOURNAL OF MACHINE LEARNING RESEARCH, 2001, 1 (02) :113-141
[5]  
[Anonymous], FDN GENETIC ALGORITH
[6]  
[Anonymous], 2008, AGGREGATION FUNCTION
[7]  
[Anonymous], 2006, DATA COMPLEXITY PATT
[8]  
[Anonymous], 2014, Evaluating Learning Algorithms A Classification Perspective, DOI DOI 10.1017/CBO9780511921803
[9]  
[Anonymous], 2010, Advances in Neural Information Processing Systems
[10]  
[Anonymous], 2007, Uci machine learning repository