Using artificial neural networks to enhance CART

被引:8
作者
Young, William A., II [1 ]
Weckman, Gary R. [2 ]
Hari, Vijaya [2 ]
Whiting, Harry S., II [2 ]
Snow, Andrew P. [3 ]
机构
[1] Ohio Univ, Coll Business, Dept Management Syst, Athens, OH 45701 USA
[2] Ohio Univ, Russ Coll Engn & Technol, Dept Ind & Syst Engn, Stocker Ctr, Athens, OH 45701 USA
[3] Ohio Univ, JW McClure Sch Informat & Telecommun Syst, Athens, OH 45701 USA
关键词
Decision trees; Classification and regression trees; CART; Artificial neural networks; Knowledge extraction; DECISION TREE; REGRESSION TREES; CLASSIFICATION; SYSTEM; PERFORMANCE; INDUCTION;
D O I
10.1007/s00521-012-0887-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accuracy is a critical factor in predictive modeling. A predictive model such as a decision tree must be accurate to draw conclusions about the system being modeled. This research aims at analyzing and improving the performance of classification and regression trees (CART), a decision tree algorithm, by evaluating and deriving a new methodology based on the performance of real-world data sets that were studied. This paper introduces a new approach to tree induction to improve the efficiency of the CART algorithm by combining the existing functionality of CART with the addition of artificial neural networks (ANNs). Trained ANNs are utilized by the tree induction algorithm by generating new, synthetic data, which have been shown to improve the overall accuracy of the decision tree model when actual training samples are limited. In this paper, traditional decision trees developed by the standard CART methodology are compared with the enhanced decision trees that utilize the ANN's synthetic data generation, or CART+. This research demonstrates the improved accuracies that can be obtained with CART+, which can ultimately improve the knowledge that can be extracted by researchers about a system being modeled.
引用
收藏
页码:1477 / 1489
页数:13
相关论文
共 52 条
[1]   Neural networks as models of psychopathology [J].
Aakerlund, L ;
Hemmingsen, R .
BIOLOGICAL PSYCHIATRY, 1998, 43 (07) :471-482
[2]   Neural network input representations that produce accurate consensus sequences from DNA fragment assemblies [J].
Allex, CF ;
Shavlik, JW ;
Blattner, FR .
BIOINFORMATICS, 1999, 15 (09) :723-728
[3]  
Andryashin A, 2005, THESIS HUMBOLDT U BE
[4]  
[Anonymous], Artificial Neural Networks/Error-Correction Learning
[5]  
[Anonymous], 1984, OLSHEN STONE CLASSIF, DOI 10.2307/2530946
[6]  
[Anonymous], CART PRO 6 0
[7]  
[Anonymous], NEUROSOLUTIONS V5 07, V07
[8]  
[Anonymous], QUEST CLASSIFICATION
[9]   Data mining with decision trees and decision rules [J].
Apte, C ;
Weiss, S .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 1997, 13 (2-3) :197-210
[10]  
Bhagat P.M., 2005, Pattern Recognition in Industry