Multi-objective genetic algorithms based automated clustering for fuzzy association rules mining

被引:33
作者
Alhajj, Reda [1 ,3 ]
Kaya, Mehmet [2 ]
机构
[1] Univ Calgary, Dept Comp Sci, Calgary, AB T2N 1N4, Canada
[2] Firat Univ, Dept Comp Engn, TR-23169 Elazig, Turkey
[3] Global Univ, Dept Comp Sci, Beirut, Lebanon
关键词
Automated clustering; CURE; Data mining; Fuzziness; Fuzzy association rules; Multi-objective genetic algorithms;
D O I
10.1007/s10844-007-0044-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Researchers realized the importance of integrating fuzziness into association rules mining in databases with binary and quantitative attributes. However, most of the earlier algorithms proposed for fuzzy association rules mining either assume that fuzzy sets are given or employ a clustering algorithm, like CURE, to decide on fuzzy sets; for both cases the number of fuzzy sets is pre-specified. In this paper, we propose an automated method to decide on the number of fuzzy sets and for the autonomous mining of both fuzzy sets and fuzzy association rules. We achieve this by developing an automated clustering method based on multi-objective Genetic Algorithms (GA); the aim of the proposed approach is to automatically cluster values of a quantitative attribute in order to obtain large number of large itemsets in less time. We compare the proposed multi-objective GA based approach with two other approaches, namely: 1) CURE-based approach, which is known as one of the most efficient clustering algorithms; 2) Chien et al. clustering approach, which is an automatic interval partition method based on variation of density. Experimental results on 100 K transactions extracted from the adult data of USA census in year 2000 showed that the proposed automated clustering method exhibits good performance over both CURE-based approach and Chien et al.'s work in terms of runtime, number of large itemsets and number of association rules.
引用
收藏
页码:243 / 264
页数:22
相关论文
共 30 条
[1]  
Ada Wai-chee Fu, 1998, Intelligent Data Engineering and Learning. Perspectives on Financial Engineering and Data Mining. 1st International Symposium. IDEAL'98, P263
[2]  
[Anonymous], 1997, P ACM SIGMOD INT C M
[3]   Determination of fuzzy logic membership functions using genetic algorithms [J].
Arslan, A ;
Kaya, M .
FUZZY SETS AND SYSTEMS, 2001, 118 (02) :297-306
[4]  
Au WH, 1998, 1998 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AT THE IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE - PROCEEDINGS, VOL 1-2, P1314, DOI 10.1109/FUZZY.1998.686309
[5]  
Chan K. C. C., 1997, Proceedings of the Sixth International Conference on Information and Knowledge Management. CIKM'97, P209, DOI 10.1145/266714.266898
[6]  
Chan Man Kuok, 1998, SIGMOD Record, V27, P41, DOI 10.1145/273244.273257
[7]  
Chien BC, 2001, JOINT 9TH IFSA WORLD CONGRESS AND 20TH NAFIPS INTERNATIONAL CONFERENCE, PROCEEDINGS, VOLS. 1-5, P1306, DOI 10.1109/NAFIPS.2001.943736
[8]  
FONSECA CM, 1993, PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON GENETIC ALGORITHMS, P416
[9]  
Goldberg D.E, 1989, GENETIC ALGORITHMS S
[10]   Cure: An efficient clustering algorithm for large databases [J].
Guha, S ;
Rastogi, R ;
Shim, K .
INFORMATION SYSTEMS, 2001, 26 (01) :35-58