Data mining based on the generalization distribution table and rough sets

被引:0
作者
Zhong, N [1 ]
Dong, JZ
Ohsuga, S
机构
[1] Yamaguchi Univ, Dept Comp Sci & Sys Eng, Yamaguchi, Japan
[2] Waseda Univ, Dept Informat & Comp Sci, Tokyo, Japan
来源
RESEARCH AND DEVELOPMENT IN KNOWLEDGE DISCOVERY AND DATA MINING | 1998年 / 1394卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces a new approach for mining if-theta rules in databases with uncertainty and incompleteness. This approach is based on the combination of Generalization Distribution Table (GDT) and the rough set methodology. The GDT provides a probabilistic basis for evaluating the strength of a rule. It is used to find the rules with larger strengths from possible rules. Furthermore, the rough set methodology is used to find minimal relative reducts from the set of rules with larger strengths. The strength of a rule represents the uncertainty of the rule, which is influenced by both unseen instances and noises. By using our approach, a minimal set of rules with larger strengths can be acquired from databases with noisy, incomplete data. We have applied this approach to discover rules from some real databases.
引用
收藏
页码:360 / 373
页数:14
相关论文
共 15 条
[1]  
Dougherty J., 1995, MACHINE LEARNING P 1, P194, DOI [10.1016/B978-1-55860-377-6.50032-3, DOI 10.1016/B978-1-55860-377-6.50032-3]
[2]  
Fayyad U, 1996, AI MAG, V17, P37
[3]  
HIRSH H, 1994, MACH LEARN, V17, P5, DOI 10.1023/A:1022600917598
[4]  
LIN TY, 1996, ADV MACHINE INTELLIG, V4, P132
[5]  
Lin TY., 1997, Rough sets and data mining: Analysis of imprecise data, DOI [10.1007/978- 1- 4613- 1461-5., DOI 10.1007/978-1-4613-1461-5, 10.1007/978-1-4613-1461-5]
[6]   GENERALIZATION AS SEARCH [J].
MITCHELL, TM .
ARTIFICIAL INTELLIGENCE, 1982, 18 (02) :203-226
[7]  
Mitchell Tom Michael, 1977, IJCAI
[8]  
MOLLESTAD T, 1996, LECT NOTES ARTIF INT, V1079, P448
[9]  
Ning Zhong, 1997, Proceedings of the First Pacific-Asia Conference on Knowledge Discovery and Data Mining. KDD: Techniques and Applications, P183
[10]  
Pawlak Z., 1991, Rough sets: Theoretical aspects of reasoning about data, V9, DOI DOI 10.1007/978-94-011-3534-4