Mining fuzzy association rules from uncertain data

被引:0
作者
Cheng-Hsiung Weng
Yen-Liang Chen
机构
[1] Central Taiwan University of Science and Technology,Department of Management Information Systems
[2] National Central University,Department of Information Management
来源
Knowledge and Information Systems | 2010年 / 23卷
关键词
Learning; Fuzzy statistics and data analysis; Uncertain data; Data mining; Fuzzy association rules;
D O I
暂无
中图分类号
学科分类号
摘要
Association rule mining is an important data analysis method that can discover associations within data. There are numerous previous studies that focus on finding fuzzy association rules from precise and certain data. Unfortunately, real-world data tends to be uncertain due to human errors, instrument errors, recording errors, and so on. Therefore, a question arising immediately is how we can mine fuzzy association rules from uncertain data. To this end, this paper proposes a representation scheme to represent uncertain data. This representation is based on possibility distributions because the possibility theory establishes a close connection between the concepts of similarity and uncertainty, providing an excellent framework for handling uncertain data. Then, we develop an algorithm to mine fuzzy association rules from uncertain data represented by possibility distributions. Experimental results from the survey data show that the proposed approach can discover interesting and valuable patterns with high certainty.
引用
收藏
页码:129 / 152
页数:23
相关论文
共 111 条
[1]  
Alcala-Fdez J(2009)Learning the membership function contexts for mining fuzzy association rules by using genetic algorithms Fuzzy Set Syst 160 905-921
[2]  
Alcala R(2001)Determination of fuzzy logic membership functions using genetic algorithms Fuzzy Set Syst 118 297-306
[3]  
Gacto MJ(2007)Data quality awareness: a case study for cost-optimal association rule mining Knowl Inf Syst 11 191-215
[4]  
Herrera F(2005)A sampling-based method for mining frequent patterns from databases Lect Notes Artif Int 3614 536-545
[5]  
Arslan A(2005)Discovering fuzzy time-interval sequential patterns in sequence databases IEEE Trans Syst Man Cybern B 35 959-972
[6]  
Kaya M(2006)A new approach for discovering fuzzy quantitative sequential patterns in sequence databases Fuzzy Set Syst 157 1641-1661
[7]  
Berti-Equille L(2005)Mining generalized knowledge from ordered data through attribute-oriented induction techniques Eur J Oper Res 166 221-245
[8]  
Chen YL(2005)Market basket analysis in a multiple store environment Decis Support Syst 40 339-354
[9]  
Ho CY(2008)Mining association rules from imprecise ordinal data Fuzzy Set Syst 159 60-474
[10]  
Chen YL(2008)A survey on algorithms for mining frequent itemsets over data streams Knowl Inf Syst 16 1-27