An interpretable classification rule mining algorithm

被引:55
作者
Cano, Alberto [1 ]
Zafra, Amelia [1 ]
Ventura, Sebastian [1 ]
机构
[1] Univ Cordoba, Dept Comp Sci & Numer Anal, E-14071 Cordoba, Spain
关键词
Classification; Evolutionary programming; Interpretability; Rule mining; COEVOLUTIONARY ALGORITHM; STATISTICAL TECHNIQUES; PROGRAMMING ALGORITHM; MULTIPLE COMPARISONS; SOFTWARE TOOL; CLASSIFIERS; SELECTION; MODELS; OPTIMIZATION; PERFORMANCE;
D O I
10.1016/j.ins.2013.03.038
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Obtaining comprehensible classifiers may be as important as achieving high accuracy in many real-life applications such as knowledge discovery tools and decision support systems. This paper introduces an efficient Evolutionary Programming algorithm for solving classification problems by means of very interpretable and comprehensible IF-THEN classification rules. This algorithm, called the Interpretable Classification Rule Mining (ICRM) algorithm, is designed to maximize the comprehensibility of the classifier by minimizing the number of rules and the number of conditions. The evolutionary process is conducted to construct classification rules using only relevant attributes, avoiding noisy and redundant data information. The algorithm is evaluated and compared to nine other well-known classification techniques in 35 varied application domains. Experimental results are validated using several non-parametric statistical tests applied on multiple classification and interpretability metrics. The experiments show that the proposal obtains good results, improving significantly the interpretability measures over the rest of the algorithms, while achieving competitive accuracy. This is a significant advantage over other algorithms as it allows to obtain an accurate and very comprehensible classifier quickly. (C) 2013 Elsevier Inc. All rights reserved.
引用
收藏
页码:1 / 20
页数:20
相关论文
共 66 条
[11]   Bloat control and generalization pressure using the minimum description length principle for a Pittsburgh approach learning classifier system [J].
Bacardit, Jaume ;
Garrell, Josep Maria .
LEARNING CLASSIFIER SYSTEMS, 2007, 4399 :59-79
[12]   Performance and Efficiency of Memetic Pittsburgh Learning Classifier Systems [J].
Bacardit, Jaume ;
Krasnogor, Natalio .
EVOLUTIONARY COMPUTATION, 2009, 17 (03) :307-342
[13]   GP-COACH: Genetic Programming-based learning of COmpact and ACcurate fuzzy rule-based classification systems for High-dimensional problems [J].
Berlanga, F. J. ;
Rivera, A. J. ;
del Jesus, M. J. ;
Herrera, F. .
INFORMATION SCIENCES, 2010, 180 (08) :1183-1200
[14]   Accuracy-based Learning Classifier Systems:: Models, analysis and applications to classification tasks [J].
Bernadó-Mansilla, E ;
Garrell-Guiu, JM .
EVOLUTIONARY COMPUTATION, 2003, 11 (03) :209-238
[15]   An evolutionary programming algorithm for survivable routing and wavelength assignment in transparent optical networks [J].
Bhanja, Urmila ;
Mahapatra, Sudipta ;
Roy, Rajarshi .
INFORMATION SCIENCES, 2013, 222 :634-647
[16]  
Bull L, 2008, STUD COMPUT INTELL, V125, P1, DOI 10.1007/978-3-540-78979-6
[17]   Toward a theory of generalization and learning in XCS [J].
Butz, MV ;
Kovacs, T ;
Lanzi, PL ;
Wilson, SW .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2004, 8 (01) :28-46
[18]   Kernel methods: a survey of current techniques [J].
Campbell, C .
NEUROCOMPUTING, 2002, 48 :63-84
[19]  
Cano A., 2011, Proceedings of the 2011 11th International Conference on Intelligent Systems Design and Applications (ISDA), P325, DOI 10.1109/ISDA.2011.6121676
[20]   Evolutionary stratified training set selection for extracting classification rules with trade off precision-interpretability [J].
Cano, Jose Ramon ;
Herrera, Francisco ;
Lozano, Manuel .
DATA & KNOWLEDGE ENGINEERING, 2007, 60 (01) :90-108