Improving Decision Tree Performance by Exception Handling

被引:8
作者
Subramanian A.A.B. [1 ]
Pramala S. [1 ]
Rajalakshmi B. [1 ]
Rajaram R. [2 ]
机构
[1] Department of Information Technology, Thiagarajar College of Engineering, Madurai
[2] Department of Computer Science, Thiagarajar College of Engineering, Madurai
关键词
association rule mining (ARM); classification; Data mining; decision tree; k-nearest-neighbour (k-NN); majority voting; naive Bayes (NB);
D O I
10.1007/s11633-010-0517-5
中图分类号
学科分类号
摘要
This paper focuses on improving decision tree induction algorithms when a kind of tie appears during the rule generation procedure for specific training datasets. The tie occurs when there are equal proportions of the target class outcome in the leaf node's records that leads to a situation where majority voting cannot be applied. To solve the above mentioned exception, we propose to base the prediction of the result on the naive Bayes (NB) estimate, k-nearest neighbour (k-NN) and association rule mining (ARM). The other features used for splitting the parent nodes are also taken into consideration. © 2010 Institute of Automation, Chinese Academy of Sciences and Springer-Verlag Berlin Heidelberg.
引用
收藏
页码:372 / 380
页数:8
相关论文
共 24 条
[1]  
Fayyad U.M., Piatetsky-Shapiro G., Smyth P., From data mining to knowledge discovery: An overview, Advances in Knowledge Discovery and Data Mining, pp. 1-34, (1996)
[2]  
Quinlan J.R., Induction of decision trees, Journal of Machine Learning, 1, 1, pp. 81-106, (1986)
[3]  
Quinlan J.R., Simplifying decision trees, International Journal of Man-Machine Studies, 27, 3, pp. 221-234, (1987)
[4]  
Utgoff P.E., Improved training via incremental learning, Proceedings of the 6th International Workshop on Machine Learning, pp. 362-365, (1989)
[5]  
Utgoff P.E., ID5: An incremental ID3, Proceedings of the 5th International Conference on Machine Learning, pp. 107-120, (1988)
[6]  
Utgoff P.E., Incremental induction of decision trees, Machine Learning, 4, 2, pp. 161-186, (1989)
[7]  
Balamurugan S.A., Rajaram R., Effective and efficient feature selection for large scale data using Bayes' theorem, International Journal of Automation and Computing, 6, 1, pp. 62-71, (2009)
[8]  
Buntine W., Learning classification trees, Statistics and Computing, 2, 2, pp. 63-73, (1992)
[9]  
Hartmann C.R.P., Varshney P.K., Mehrotra K.G., Gerberich C.L., Application of information theory to the construction of efficient decision trees, IEEE Transactions on Information Theory, 28, 4, pp. 565-577, (1982)
[10]  
Mickens J., Szummer M., Narayanan D., Snitch interactive decision trees for troubleshooting misconfigurations, Proceedings of the 2nd USENIX Workshop on Tackling Computer Systems Problems with Machine Learning Techniques, (2007)