Dynamic selection and combination of one-class classifiers for multi-class classification

被引:6
作者
Fragoso, Rogerio C. P. [1 ]
Cavalcanti, George D. C. [1 ]
Pinheiro, Roberto H. W. [2 ]
Oliveira, Luiz S. [3 ]
机构
[1] Univ Fed Pernambuco, Ctr Informat, Recife, PE, Brazil
[2] Univ Fed Cariri, Juazeiro Do Norte, CE, Brazil
[3] Univ Fed Parana, Dept Informat, Curitiba, PR, Brazil
关键词
One-class classification; One-class decomposition; Multiple classifier system; Dynamic ensemble selection; DATA SET; NUMBER; CLUSTERS; CRITERION; ENSEMBLES; SUPPORT;
D O I
10.1016/j.knosys.2021.107290
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A natural solution to tackle multi-class problems is employing multi-class classifiers. However, in specific situations, such as imbalanced data or high number of classes, it is more effective to decompose the multi-class problem into several and easier to solve problems. One-class decomposition is an alternative, where one-class classifiers (OCCs) are trained for each class separately. However, fitting the data optimally is a challenge for OCCs, especially when it presents a complex intra-class distribution. The literature shows that multiple classifier systems are inherently robust in such cases. Thus, the adoption of multiple OCCs for each class can lead to an improvement for one-class decomposition. With that in mind, in this work we introduce the method called One-class Classifier Dynamic Ensemble Selection for Multi-class problems (MODES, for short), which provides competent classifiers for each region of the feature space by decomposing the original multi-class problem into multiple one-class problems. So, each class is segmented using a set of cluster validity indices, and an OCC is trained for each cluster. The rationale is to reduce the complexity of the classification task by defining a region of the feature space where the classifier is supposed to be an expert. The classification of a test example is performed by dynamically selecting an ensemble of competent OCCs and the final decision is given by the reconstruction of the original multi-class problem. Experiments carried out with 25 databases, 4 OCC models, and 3 aggregation methods showed that the proposed architecture outperforms the literature. When compared with the state-of-the-art, MODES obtained better results, especially for databases with complex decision regions. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:14
相关论文
共 57 条
[1]  
[Anonymous], 2014, COMBINING PATTERN CL
[2]  
[Anonymous], 2018, DDTOOLS DATA DESCRIP
[3]   An extensive comparative study of cluster validity indices [J].
Arbelaitz, Olatz ;
Gurrutxaga, Ibai ;
Muguerza, Javier ;
Perez, Jesus M. ;
Perona, Inigo .
PATTERN RECOGNITION, 2013, 46 (01) :243-256
[4]  
Ball GH, 1965, ISODATA NOVEL METHOD
[5]  
Benavoli A, 2016, J MACH LEARN RES, V17
[6]  
Caliski T., 1974, COMMUN STAT, V3, P1, DOI [10.1080/03610927408827101, DOI 10.1080/03610927408827101]
[7]   Preprocessing-Free Gear Fault Diagnosis Using Small Datasets With Deep Convolutional Neural Network-Based Transfer Learning [J].
Cao, Pei ;
Zhang, Shengli ;
Tang, Jiong .
IEEE ACCESS, 2018, 6 :26241-26253
[8]  
Charrad M, 2014, J STAT SOFTW, V61, P1
[9]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[10]   A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES [J].
COHEN, J .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) :37-46