Rough Set Approach to Multivariate Decision Trees Inducing

被引:3
作者
Wang, Dianhong [1 ]
Liu, Xingwen [1 ]
Jiang, Liangxiao [1 ]
Zhang, Xiaoting [1 ]
Zhao, Yongguang [1 ]
机构
[1] China Univ Geosci, Wuhan 430074, Hubei, Peoples R China
关键词
decision tree; classification; multivariate decision trees (MDT); rough set; positive region; generalization;
D O I
10.4304/jcp.7.4.870-879
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Aimed at the problem of huge computation, large tree size and over-fitting of the testing data for multivariate decision tree (MDT) algorithms, we proposed a novel rough-set-based multivariate decision trees (RSMDT) method. In this paper, the positive region degree of condition attributes with respect to decision attributes in rough set theory is used for selecting attributes in multivariate tests. And a new concept of extended generalization of one equivalence relation corresponding to another one is introduced and used for construction of multivariate tests. We experimentally test RSMDT algorithm in terms of classification accuracy, tree size and computing time, using the whole 36 UCI Machine Learning Repository data sets selected by Weka platform, and compare it with C4.5, classification and regression trees (CART), classification and regression trees with linear combinations (CART-LC), Oblique Classifier 1 (OC1), Quick Unbiased Efficient Statistical Trees (QUEST). The experimental results indicate that RSMDT algorithm significantly outperforms the comparison classification algorithms with improved classification accuracy, relatively small tree size, and shorter computing time.
引用
收藏
页码:870 / 879
页数:10
相关论文
共 33 条
[1]  
Bennett K. P., 1992, TECHNICAL REPORT
[2]  
Blake C.L., 1998, UCI REPOSITORY MACHI
[3]  
Breiman L., 2017, CLASSIFICATION REGRE
[4]  
BRODLEY CE, 1995, MACH LEARN, V19, P45, DOI 10.1007/BF00994660
[5]  
BRODLEY CE, 1992, MULTIVARIATE VERSUS
[6]  
BUNTINE W, 1992, INTRO IND VERSION 2
[7]   A rough set approach to attribute generalization in data mining [J].
Chan, CC .
INFORMATION SCIENCES, 1998, 107 (1-4) :169-176
[8]  
Demsar J, 2006, J MACH LEARN RES, V7, P1
[9]   Technical note: Using model trees for classification [J].
Frank, E ;
Wang, Y ;
Inglis, S ;
Holmes, G ;
Witten, IH .
MACHINE LEARNING, 1998, 32 (01) :63-76
[10]   RainForest - A framework for fast decision tree construction of large datasets [J].
Gehrke, J ;
Ramakrishnan, R ;
Ganti, V .
DATA MINING AND KNOWLEDGE DISCOVERY, 2000, 4 (2-3) :127-162