An Algorithm for Mining Frequent Closed Itemsets in Data Stream

被引:0
作者
Dai, Caiyan [1 ]
Chen, Ling [1 ]
机构
[1] Yangzhou Univ, Dept Comp Sci, Yangzhou 225009, Peoples R China
来源
INTERNATIONAL CONFERENCE ON APPLIED PHYSICS AND INDUSTRIAL ENGINEERING 2012, PT C | 2012年 / 24卷
关键词
stream data; closed frequent data itemsets; sliding window;
D O I
10.1016/j.phpro.2012.02.254
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Mining frequent itemsets from data streams by the model of sliding window has been extensively studied. This paper presents an algorithm AFPCFI-DS for mining the frequent itemsets from data streams. The algorithm detects the frequent items using a FP-tree in each sliding window. In processing each new window the algorithm first changes the head table and then modifies the FP-tree according to the changed items in the head table. The algorithm also adopts local updating strategy to avoid the time-consuming operations of searching in the whole tree to add or delete transactions. Our experimental results show that the algorithm is more efficient and has lower time and memory complexity than the algorithms Moment and FPCFI-DS. (C) 2011 Published by Elsevier B.V. Selection and/or peer-review under responsibility of ICAPIE Organization Committee.
引用
收藏
页码:1722 / 1728
页数:7
相关论文
共 10 条
[1]  
Agrawal R., 1994, P 20 INT C VER LARG, P487, DOI DOI 10.5555/645920.672836
[2]   An Efficient Algorithm for Mining Closed Frequent Itemsets in Data Streams [J].
Ao, Fujiang ;
Du, Jing ;
Yan, Yuejin ;
Liu, Baohong ;
Huang, Kedi .
8TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY WORKSHOPS: CIT WORKSHOPS 2008, PROCEEDINGS, 2008, :37-+
[3]  
Chang J.H., 2003, P 9 ACM SIGKDD INT C, P487, DOI DOI 10.1145/956750.956807
[4]   Moment: Maintaining closed frequent itemsets over a stream sliding window [J].
Chi, Y ;
Wang, HX ;
Yu, PS ;
Muntz, RR .
FOURTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2004, :59-66
[5]  
Gannell C., 2003, MINING FREQUENT ITEM
[6]  
Han, P ACM SIGMOD 2000
[7]  
Pasquier N, 1999, LECT NOTES COMPUT SC, V1540, P398
[8]  
Singh Manku G., 2002, Proceedings of the Twenty-eighth International Conference on Very Large Data Bases, P346
[9]  
Teng WG, 2003, P 29 INT C VER LARG, P607
[10]  
Zhu Y., 2002, VLDB, P358