An improved association rules mining method

被引:31
作者
Liu, Xiaobing [2 ]
Zhai, Kun [1 ,2 ]
Pedrycz, Witold [1 ,3 ]
机构
[1] Univ Alberta, Dept Elect & Comp Engn, Edmonton, AB T6G 2G7, Canada
[2] Dalian Univ Technol, Fac Management, Dalian 116023, Peoples R China
[3] Polish Acad Sci, Syst Res Inst, PL-01447 Warsaw, Poland
关键词
Association rule; Maximal frequent itemsets; Directed itemsets graph; Trifurcate linked list storage structure; Mining algorithm; FREQUENT ITEMSETS; EFFICIENT ALGORITHM; DATA STREAMS; DATABASES; SUPPORT; SEARCH; TREE; SET;
D O I
10.1016/j.eswa.2011.08.018
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mining maximal frequent itemsets is of paramount relevance in many of data mining applications. The "traditional" algorithms address this problem through scanning databases many times. The latest research has already focused on reducing the number of scanning times of databases and then decreasing the number of accessing times of I/O resources in order to improve the overall mining efficiency of maximal frequent itemsets of association rules. In this paper, we present a form of the directed itemsets graph to store the information of frequent itemsets of transaction databases, and give the trifurcate linked list storage structure of directed itemsets graph. Furthermore, we develop the mining algorithm of maximal frequent itemsets based on this structure. As a result, one realizes scanning a database only once, and improves storage efficiency of data structure and time efficiency of mining algorithm. (C) 2011 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1362 / 1374
页数:13
相关论文
共 48 条
[1]  
Agarwal R. C., 2000, Proceedings. KDD-2000. Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, P108, DOI 10.1145/347090.347114
[2]  
AGRAWAL R, 1996, P ACM SIGMOD INT C M, P207
[3]  
Agrawal R., 1994, P 20 INT C VER LARG, P487, DOI DOI 10.5555/645920.672836
[4]  
[Anonymous], UCI MACH LEARN REP
[5]  
[Anonymous], P 1998 ACM SIGMOD IN
[6]   IMine: Index Support for Item Set Mining [J].
Baralis, Elena ;
Cerquitelli, Tania ;
Chiusano, Silvia .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (04) :493-506
[7]   Cohesion: A concept and framework for confident association discovery with potential application in microarray mining [J].
Bhattacharyya, Ramkishore .
APPLIED SOFT COMPUTING, 2011, 11 (01) :592-604
[8]   MAFIA: A maximal frequent itemset algorithm for transactional databases [J].
Burdick, D ;
Calimlim, M ;
Gehrke, J .
17TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2001, :443-452
[9]   Efficient strategies for tough aggregate constraint-based sequential pattern mining [J].
Chen, Enhong ;
Cao, Huanhuan ;
Li, Qing ;
Qian, Tieyun .
INFORMATION SCIENCES, 2008, 178 (06) :1498-1518
[10]   Continuous Subgraph Pattern Search over Certain and Uncertain Graph Streams [J].
Chen, Lei ;
Wang, Changliang .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (08) :1093-1109