Multi-objective genetic fuzzy classifiers for imbalanced and cost-sensitive datasets

被引:0
作者
Pietro Ducange
Beatrice Lazzerini
Francesco Marcelloni
机构
[1] University of Pisa,Dipartimento di Ingegneria dell’Informazione, Elettronica, Informatica, Telecomunicazioni
来源
Soft Computing | 2010年 / 14卷
关键词
Genetic fuzzy rule-based classifiers; Multi-objective evolutionary algorithms; Imbalanced datasets; ROC curves; Convex hull method;
D O I
暂无
中图分类号
学科分类号
摘要
We exploit an evolutionary three-objective optimization algorithm to produce a Pareto front approximation composed of fuzzy rule-based classifiers (FRBCs) with different trade-offs between accuracy (expressed in terms of sensitivity and specificity) and complexity (computed as sum of the conditions in the antecedents of the classifier rules). Then, we use the ROC convex hull method to select the potentially optimal classifiers in the projection of the Pareto front approximation onto the ROC plane. Our method was tested on 13 highly imbalanced datasets and compared with 2 two-objective evolutionary approaches and one heuristic approach to FRBC generation, and with three well-known classifiers. We show by the Wilcoxon signed-rank test that our three-objective optimization approach outperforms all the other techniques, except for one classifier, in terms of the area under the ROC convex hull, an accuracy measure used to globally compare different classification approaches. Further, all the FRBCs in the ROC convex hull are characterized by a low value of complexity. Finally, we discuss how, the misclassification costs and the class distributions are fixed, we can select the most suitable classifier for the specific application. We show that the FRBC selected from the convex hull produced by our three-objective optimization approach achieves the lowest classification cost among the techniques used as comparison in two specific medical applications.
引用
收藏
页码:713 / 728
页数:15
相关论文
共 121 条
  • [1] Alcalá R(2007)A multi-objective genetic algorithm for tuning and rule selection to obtain accurate and compact linguistic fuzzy rule-based systems Int J Uncertain Fuzziness Knowl Based Syst 15 521-537
  • [2] Gacto MJ(2009)KEEL: a software tool to assess evolutionary algorithms to data mining problems Soft Comput 13 307-318
  • [3] Herrera F(1998)Optimization and FROC analysis of rule-based detection schemes using a multiobjective approach IEEE Trans Med Imaging 17 1089-1093
  • [4] Alcalá-Fdez J(2004)Pulmonary nodules at chest CT: effect of computer-aided diagnosis on radiologists’ detection performance Radiology 230 347-352
  • [5] Alcalá-Fdez J(2004)A study of the behaviour of several methods for balancing machine learning SIGKDD Explor 6 20-29
  • [6] Sánchez L(2001)Genetic feature selection in a fuzzy rule-based classification system learning process for high-dimensional problems Inf Sci 136 135-157
  • [7] García S(2005)Genetic tuning of fuzzy rule deep structures preserving interpretability and its interaction with fuzzy rule set reduction IEEE Trans Fuzzy Syst 13 13-29
  • [8] del Jesus MJ(2007)Special issue on genetic fuzzy systems and the Interpretability-Accuracy Trade-off Int J Approx Reason 44 1-3
  • [9] Ventura S(2004)Evolutionary design of a fuzzy classifier from data IEEE Trans Syst Man Cybern B 34 1894-1906
  • [10] Garrell JM(2002)Smote: synthetic minority over-sampling technique J Artif Intell Res 16 321-357