Construction of decision trees based entropy and rough sets under tolerance relation

被引:4
作者
Yang, Ning [1 ]
Li, Tianrui [1 ]
Song, Jing [1 ]
机构
[1] SW Jiaotong Univ, Dept Math, Chengdu 610031, Peoples R China
来源
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (ISKE 2007) | 2007年
关键词
data mining; decision tree; rough set; tolerance relation;
D O I
10.2991/iske.2007.258
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Decision tree induction is one of the most popular data mining techniques with applications in various fields. Present algorithms for construction decision trees can not deal with missing value in information systems properly. A new concept, rough gain ratio, is first introduced by the aid of tolerance relations in the extended rough sets theory. Then, an approach for inducing decision trees under the rough gain ratio is presented. Examples show that the decision trees generated by the proposed method tend to have simpler structure and more understandable rules than C4.5.
引用
收藏
页数:1
相关论文
共 15 条
[1]  
Acuña E, 2004, ST CLASS DAT ANAL, P639
[2]  
Batista GEAPA, 2003, APPL ARTIF INTELL, V17, P519, DOI [10.1080/713827181, 10.1080/08839510390219309]
[3]  
Berry M.J., 2000, MASTERING DATA MININ
[4]   A DISTANCE-BASED ATTRIBUTE SELECTION MEASURE FOR DECISION TREE INDUCTION [J].
DEMANTARAS, RL .
MACHINE LEARNING, 1991, 6 (01) :81-92
[5]  
[蒋芸 Jiang Yun], 2004, [计算机应用, Computer Applications], V24, P21
[6]   Rough set approach to incomplete information systems [J].
Kryszkiewicz, M .
INFORMATION SCIENCES, 1998, 112 (1-4) :39-49
[7]   A rough sets based characteristic relation approach for dynamic attribute generalization in data mining [J].
Li, Tianrui ;
Ruan, Da ;
Geert, Wets ;
Song, Jing ;
Xu, Yang .
KNOWLEDGE-BASED SYSTEMS, 2007, 20 (05) :485-494
[8]  
Miao Duoqian, 1997, Journal of Software, V8, P425
[9]   Post-pruning in decision tree induction using multiple performance measures [J].
Osei-Bryson, Kweku-Muata .
COMPUTERS & OPERATIONS RESEARCH, 2007, 34 (11) :3331-3345
[10]  
Pawlak Z., 1991, Rough sets: Theoretical aspects of reasoning about data, DOI DOI 10.1007/978-94-011-3534-4