An experimental study on evolutionary fuzzy classifiers designed for managing imbalanced datasets

被引:20
作者
Antonelli, Michela [1 ]
Ducange, Pietro [1 ]
Marcelloni, Francesco [1 ]
机构
[1] Univ Pisa, Dipartimento Ingn Informaz, I-56122 Pisa, Italy
关键词
Genetic and evolutionary fuzzy systems; Fuzzy rule-based classifiers; Imbalanced datasets; CLASSIFICATION SYSTEMS; FEATURE-SELECTION; ALGORITHM; ENSEMBLES; PROPOSAL; TRENDS; SMOTE;
D O I
10.1016/j.neucom.2014.04.070
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we show an experimental study on a set of evolutionary fuzzy classifiers (EFCs) purposely designed to manage imbalanced datasets. Three of these EFCs represent the state-of-the-art of the main approaches to the evolutionary generation of fuzzy rule-based systems for imbalanced dataset classification. The fourth EFC is an extension of a multi-objective evolutionary learning (MOEL) scheme we have recently proposed for managing imbalanced datasets: the rule base and the membership function parameters of a set of FRBCs are concurrently learned by optimizing the sensitivity, the specificity and the complexity. By using non-parametric tests, we first compare the results obtained by the four EFCs in terms of area under the ROC curve. We show that our MOEL scheme outperforms two of the comparison algorithms and results to be statistically equivalent to the third. Further, the classifiers generated by our MOEL scheme are characterized by a lower number of rules than the ones generated by the other approaches. To validate the effectiveness of our MOEL scheme in dealing with imbalanced datasets, we also compare our results with the ones achieved, after rebalancing the datasets, by two state-of-the-art algorithms, namely FURIA and FARC-HD, proposed for generating fuzzy rule-based classifiers for balanced datasets. We show that our MOEL scheme is statistically equivalent to FURIA, which is associated with the highest accuracy rank in the statistical tests. However, the rule bases generated by FURIA are characterized by a low interpretability. Finally, we show that the results achieved by our MOEL scheme are statistically equivalent to the ones achieved by four state-of-the-art approaches, based on ensembles of non-fuzzy classifiers, appropriately designed for dealing with imbalanced datasets. (C) 2014 Published by Elsevier B.V.
引用
收藏
页码:125 / 136
页数:12
相关论文
共 72 条
[1]   A proposal for the genetic lateral tuning of linguistic fuzzy systems and its interaction with rule selection [J].
Alcala, Rafael ;
Alcala-Fdez, Jesus ;
Herrera, Francisco .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2007, 15 (04) :616-635
[2]  
Alcalá-Fdez J, 2011, J MULT-VALUED LOG S, V17, P255
[3]   A Fuzzy Association Rule-Based Classification Model for High-Dimensional Problems With Genetic Rule Selection and Lateral Tuning [J].
Alcala-Fdez, Jesus ;
Alcala, Rafael ;
Herrera, Francisco .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2011, 19 (05) :857-872
[4]  
[Anonymous], 1996, SCI PSYCH S
[5]  
[Anonymous], ADV FUZZY SYSTEMS
[6]  
[Anonymous], 2012, IEEE T SYST MAN CY C, DOI DOI 10.1109/TSMCC.2011.2161285
[7]  
[Anonymous], PATTERN RECOGN LETT
[8]   Genetic Training Instance Selection in Multiobjective Evolutionary Fuzzy Systems: A Coevolutionary Approach [J].
Antonelli, Michela ;
Ducange, Pietro ;
Marcelloni, Francesco .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2012, 20 (02) :276-290
[9]   Learning concurrently data and rule bases of Mamdani fuzzy rule-based systems by exploiting a novel interpretability index [J].
Antonelli, Michela ;
Ducange, Pietro ;
Lazzerini, Beatrice ;
Marcelloni, Francesco .
SOFT COMPUTING, 2011, 15 (10) :1981-1998
[10]   Computer-aided detection of lung nodules based on decision fusion techniques [J].
Antonelli, Michela ;
Cococcioni, Marco ;
Lazzerini, Beatrice ;
Marcelloni, Francesco .
PATTERN ANALYSIS AND APPLICATIONS, 2011, 14 (03) :295-310