A sliding window based algorithm for frequent closed itemset mining over data streams

被引:37
作者
Nori, Fatemeh [1 ]
Deypir, Mahmood [2 ]
Sadreddini, Mohamad Hadi [1 ]
机构
[1] Shiraz Univ, Sch Engn, Dept Comp Sci & Engn, Shiraz, Iran
[2] Univ Aeronaut Sci & Technol, Tehran, Iran
关键词
Closed frequent itemsets; Data stream; Sliding window; Concept change; Data mining; PATTERNS;
D O I
10.1016/j.jss.2012.10.011
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Frequent pattern mining over data streams is an important problem in the context of data mining and knowledge discovery. Mining frequent closed itemsets within sliding window instead of complete set of frequent itemset is very interesting since it needs a limited amount of memory and processing power. Moreover, handling concept change within a compact set of closed patterns is faster. However, it requires flexible and efficient data structures as well as intuitive algorithms. In this paper, we have introduced an effective and efficient algorithm for closed frequent itemset mining over data streams operating in the sliding window model. This algorithm uses a novel data structure for storing transactions of the window and corresponding frequent closed itemsets. Moreover, the support of a new frequent closed itemset is efficiently computed and an old pattern is removed from the monitoring set when it is no longer frequent closed itemset. Extensive experiments on both real and synthetic data streams show that the proposed algorithm is superior to previously devised algorithms in terms of runtime and memory usage. Published by Elsevier Inc.
引用
收藏
页码:615 / 623
页数:9
相关论文
共 26 条
[1]  
Agrawal R., P 20 INT C VERY LARG
[2]   estWin:: Online data stream mining of recent frequent itemsets by sliding window method [J].
Chang, JH ;
Lee, WS .
JOURNAL OF INFORMATION SCIENCE, 2005, 31 (02) :76-90
[3]   Finding recently frequent itemsets adaptively over online transactional data streams [J].
Chang, Joong Hyuk ;
Lee, Won Suk .
INFORMATION SYSTEMS, 2006, 31 (08) :849-869
[4]   Catch the moment: maintaining closed frequent itemsets over a data stream sliding window [J].
Chi, Yun ;
Wang, Haixun ;
Yu, Philip S. ;
Muntz, Richard R. .
KNOWLEDGE AND INFORMATION SYSTEMS, 2006, 10 (03) :265-294
[5]   EclatDS: An efficient sliding window based frequent pattern mining method for data streams [J].
Deypir, Mahmood ;
Sadreddini, Mohammad Hadi .
INTELLIGENT DATA ANALYSIS, 2011, 15 (04) :571-587
[6]  
Gopalan R. P., 2004, P SIAM INT WORKSH HI
[7]   Mining frequent patterns without candidate generation: A frequent-pattern tree approach [J].
Han, JW ;
Pei, J ;
Yin, YW ;
Mao, RY .
DATA MINING AND KNOWLEDGE DISCOVERY, 2004, 8 (01) :53-87
[8]  
Jiang N., 2006, P SIGKDD 2006
[9]  
Koh JL, 2009, LECT NOTES COMPUT SC, V5667, P334
[10]  
Leung CKS, 2006, IEEE DATA MINING, P928