Evaluation of decision trees: a multi-criteria approach

被引:86
作者
Osei-Bryson, KM
机构
[1] Virginia Commonwealth Univ, Dept Informat Syst, Richmond, VA 23284 USA
[2] Virginia Commonwealth Univ, Informat Syst Res Inst, Sch Business, Richmond, VA 23284 USA
关键词
decision tree; evaluation; performance measures; multi-criteria decision analysis;
D O I
10.1016/S0305-0548(03)00156-4
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Data mining (DM) techniques are being increasingly used in many modem organizations to retrieve valuable knowledge structures from organizational databases, including data warehouses. An important knowledge structure that can result from data mining activities is the decision tree (DT) that is used for the classification of future events. The induction of the decision tree is done using a supervised knowledge discovery process in which prior knowledge regarding classes in the database is used to guide the discovery. The generation of a DT is a relatively easy task but in order to select the most appropriate DT it is necessary for the DM project team to generate and analyze a significant number of DTs based on multiple performance measures. We propose a multi-criteria decision analysis based process that would empower DM project teams to do thorough experimentation and analysis without being overwhelmed by the task of analyzing a significant number of DTs would offer a positive contribution to the DM process. We also offer some new approaches for measuring some of the performance criteria. (C) 2003 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1933 / 1945
页数:13
相关论文
共 15 条
[1]  
BERRY M, 2000, MASTERING MDATA MINI
[2]  
BOHANEC M, 1994, MACH LEARN, V15, P223, DOI 10.1023/A:1022685808937
[3]   The use of the area under the roc curve in the evaluation of machine learning algorithms [J].
Bradley, AP .
PATTERN RECOGNITION, 1997, 30 (07) :1145-1159
[4]   MODELING PAIRWISE COMPARISONS ON RATIO SCALES [J].
BRYSON, N ;
MOBOLURIN, A ;
NGWENYAMA, O .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1995, 83 (03) :639-654
[5]   A GOAL PROGRAMMING METHOD FOR GENERATING PRIORITY VECTORS [J].
BRYSON, N .
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 1995, 46 (05) :641-648
[6]  
BRYSON N, 2000, J MULTICRITERIA DECI, V9, P127
[7]   A comparative analysis of methods for pruning decision trees [J].
Esposito, F ;
Malerba, D ;
Semeraro, G .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (05) :476-491
[8]  
Garofalakis M., 2000, Proceedings. KDD-2000. Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, P335, DOI 10.1145/347090.347163
[9]  
Gersten W., 2000, Proceedings. KDD-2000. Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, P398, DOI 10.1145/347090.347174
[10]  
Han J., 2012, Data Mining, P393, DOI [DOI 10.1016/B978-0-12-381479-1.00009-5, 10.1016/B978-0-12-381479-1.00001-0]