Mining fuzzy association rules from heterogeneous probabilistic datasets

被引:3
作者
Pei, Bin [1 ]
Zhao, Tingting [1 ]
Zhao, Suyun [1 ]
Chen, Hong [1 ]
机构
[1] MOE, Key Lab Data Engn & Knowledge Engn, Beijing, Peoples R China
来源
2012 IEEE 24TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2012), VOL 1 | 2012年
关键词
fuzzy association rules; heterogeneous probabilistic dataset; data mining; FREQUENT ITEMSETS; UNCERTAIN;
D O I
10.1109/ICTAI.2012.116
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Association rule mining (ARM), as a useful method to discover relations between attributes of objects, has been widely studied. The previous methods focused on ARM either from a certain dataset with different type attributes, or from a probabilistic dataset with only Boolean attributes. However, little work on ARM from a probabilistic dataset with coexistence of different type attributes has been mentioned. Such dataset is named Heterogeneous Probabilistic Dataset (HPD), which is prevalent in the real-world applications. This paper develops a generic framework to discover association rules from a HPD. Considering the different type data in the dataset, we first convert a HPD to a probabilistic dataset with fuzzy sets by fuzzification. A novel Shannon-like Entropy is then introduced to measure the information of an item with coexistence of fuzzy uncertainty hidden in different type data and random uncertainty in the transformed dataset. Based on this Shannon-like Entropy, Support and Confidence degrees for such multi-uncertain dataset are defined. Finally, we design an Apriori-like algorithm to mine association rules from a HPD using the above measures. Experimental results show that the proposed algorithm for HPD is feasible and effective.
引用
收藏
页码:828 / 835
页数:8
相关论文
共 21 条
[1]  
Abiteboul S., 1987, SIGMOD Record, V16, P34, DOI 10.1145/38714.38724
[2]  
Aggarwal CC, 2009, ADV DATABASE SYST, V35, P427
[3]  
Aggarwal CC, 2009, KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, P29
[4]  
Agrawal R., P 20 INT C VERY LARG
[5]  
[Anonymous], 2006, IEEE Date Eng. Bull.
[6]  
[Anonymous], 1988, Possibility Theory
[7]  
Antova L., 2007, Proceedings of the 2007 IEEE 23rd International Conference on Data Engineering (ICDE'07), P606
[8]  
Bernecker T, 2009, KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, P119
[9]  
BOSC P, 2001, P IFSA NAFIPS 2001 V
[10]  
Calders T, 2010, LECT NOTES ARTIF INT, V6118, P480