Mining fuzzy association rules from uncertain data

被引:38
作者
Weng, Cheng-Hsiung [1 ]
Chen, Yen-Liang [2 ]
机构
[1] Cent Taiwan Univ Sci & Technol, Dept Management Informat Syst, Taichung 406, Taiwan
[2] Natl Cent Univ, Dept Informat Management, Chungli 320, Taiwan
关键词
Learning; Fuzzy statistics and data analysis; Uncertain data; Data mining; Fuzzy association rules; MEMBERSHIP FUNCTIONS; SEQUENTIAL PATTERNS; FREQUENT ITEMSETS; CLASSIFICATION; INDUCTION; KNOWLEDGE; SUPPORT;
D O I
10.1007/s10115-009-0223-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Association rule mining is an important data analysis method that can discover associations within data. There are numerous previous studies that focus on finding fuzzy association rules from precise and certain data. Unfortunately, real-world data tends to be uncertain due to human errors, instrument errors, recording errors, and so on. Therefore, a question arising immediately is how we can mine fuzzy association rules from uncertain data. To this end, this paper proposes a representation scheme to represent uncertain data. This representation is based on possibility distributions because the possibility theory establishes a close connection between the concepts of similarity and uncertainty, providing an excellent framework for handling uncertain data. Then, we develop an algorithm to mine fuzzy association rules from uncertain data represented by possibility distributions. Experimental results from the survey data show that the proposed approach can discover interesting and valuable patterns with high certainty.
引用
收藏
页码:129 / 152
页数:24
相关论文
共 54 条
[1]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[2]  
Agrawal R., 1994, P 20 INT C VER LARG, P487, DOI DOI 10.5555/645920.672836
[3]   Learning the membership function contexts for mining fuzzy association rules by using genetic algorithms [J].
Alcala-Fdez, Jesus ;
Alcala, Rafael ;
Jose Gacto, Maria ;
Herrera, Francisco .
FUZZY SETS AND SYSTEMS, 2009, 160 (07) :905-921
[4]   Determination of fuzzy logic membership functions using genetic algorithms [J].
Arslan, A ;
Kaya, M .
FUZZY SETS AND SYSTEMS, 2001, 118 (02) :297-306
[5]  
Berry MichaelJ., 1997, DATA MINING TECHNIQU
[6]   Data quality awareness: a case study for cost optimal association rule mining [J].
Berti-Equille, Laure .
KNOWLEDGE AND INFORMATION SYSTEMS, 2007, 11 (02) :191-215
[7]   Mining association rules from imprecise ordinal data [J].
Chen, Yen-Liang ;
Weng, Cheng-Hsiung .
FUZZY SETS AND SYSTEMS, 2008, 159 (04) :460-474
[8]   A new approach for discovering fuzzy quantitative sequential patterns in sequence databases [J].
Chen, Yen-Liang ;
Huang, Tony Cheng-Kui .
FUZZY SETS AND SYSTEMS, 2006, 157 (12) :1641-1661
[9]   Discovering fuzzy time-interval sequential patterns in sequence databases [J].
Chen, YL ;
Huang, TCK .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2005, 35 (05) :959-972
[10]   Mining generalized knowledge from ordered data through attribute-oriented induction techniques [J].
Chen, YL ;
Shen, CC .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2005, 166 (01) :221-245