Classification with decision trees from a nonparametric predictive inference perspective

被引:24
作者
Abellan, Joaquin [1 ]
Baker, Rebecca M. [2 ]
Coolen, Frank P. A. [2 ]
Crossman, Richard J. [3 ]
Masegosa, Andres R. [1 ]
机构
[1] Univ Granada, Dept Comp Sci & Artificial Intelligence, Granada, Spain
[2] Univ Durham, Dept Math Sci, Durham, England
[3] Univ Warwick, Warwick Med Sch, Coventry CV4 7AL, W Midlands, England
关键词
Imprecise probabilities; Imprecise Dirichlet model; Nonparametric predictive inference model; Uncertainty measures; Supervised classification; Decision trees; IMPRECISE DIRICHLET MODEL; TOTAL UNCERTAINTY; STATISTICAL COMPARISONS; CLASSIFIERS; ENTROPY; SETS;
D O I
10.1016/j.csda.2013.02.009
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
An application of nonparametric predictive inference for multinomial data (NPI) to classification tasks is presented. This model is applied to an established procedure for building classification trees using imprecise probabilities and uncertainty measures, thus far used only with the imprecise Dirichlet model (IDM), that is defined through the use of a parameter expressing previous knowledge. The accuracy of that procedure of classification has a significant dependence on the value of the parameter used when the IDM is applied. A detailed study involving 40 data sets shows that the procedure using the NPI model (which has no parameter dependence) obtains a better trade-off between accuracy and size of tree than does the procedure when the IDM is used, whatever the choice of parameter. In a bias-variance study of the errors, it is proved that the procedure with the NPI model has a lower variance than the one with the IDM, implying a lower level of over-fitting. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:789 / 802
页数:14
相关论文
共 47 条
[2]   An algorithm to compute the upper entropy for order-2 capacities [J].
Abellán, J ;
Moral, S .
INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2006, 14 (02) :141-154
[3]   Disaggregated total uncertainty measure for credal sets [J].
Abellán, J ;
Klir, GJ ;
Moral, S .
INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 2006, 35 (01) :29-44
[4]   Upper entropy of credal sets.: Applications to credal classification [J].
Abellán, J ;
Moral, S .
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2005, 39 (2-3) :235-255
[5]   Maximum of entropy for credal sets [J].
Abellan, J ;
Moral, S .
INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2003, 11 (05) :587-597
[6]   Building classification trees using the total uncertainty criterion [J].
Abellán, J ;
Moral, S .
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2003, 18 (12) :1215-1225
[7]   IMPRECISE CLASSIFICATION WITH CREDAL DECISION TREES [J].
Abellan, Joaquin ;
Masegosa, Andres R. .
INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2012, 20 (05) :763-787
[8]   Bagging schemes on the presence of class noise in classification [J].
Abellan, Joaquin ;
Masegosa, Andres R. .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (08) :6827-6837
[9]   Maximising entropy on the nonparametric predictive inference model for multinomial data [J].
Abellan, Joaquin ;
Baker, Rebecca M. ;
Coolen, Frank P. A. .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2011, 212 (01) :112-122
[10]   An ensemble method using credal decision trees [J].
Abellan, Joaquin ;
Masegosa, Andres R. .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2010, 205 (01) :218-226