Mining maximal frequent itemsets in a sliding window over data streams

被引:0
作者
Mao Y. [1 ,2 ]
Li H. [1 ]
Yang L. [1 ]
Liu L. [1 ]
机构
[1] School of Information Science and Engineering, Central South University
[2] Applied Science Institute of Jiangxi University of Science and Technology
来源
Gaojishu Tongxin/Chinese High Technology Letters | 2010年 / 20卷 / 11期
关键词
Data mining; Data stream; Frequent itemsets; Maximal frequent itemsets; Sliding window;
D O I
10.3772/j.issn.1002-0470.2010.11.008
中图分类号
学科分类号
摘要
In consideration of the problem of data and pattern redundancy in frequent itemsets mining and the close attention to the study of mining maximal frequent itemsets from data streams, this paper presents the MMFI-SW algorithm for mining maximal frequent itemsets in a sliding window over data streams. Firstly, it uses a data structure based on FP-tree to record the current information in streams, at the same time, the obsolete items and a lot of infrequent items are deleted by pruning the tree. Then, it designs a novel method to mine the set of all maximal frequent itemsets in a sliding window over data streams. The theoretical analysis and the experimental results show that the proposed method is efficient.
引用
收藏
页码:1142 / 1148
页数:6
相关论文
共 12 条
[1]  
Li H.F., Lee S.Y., Online mining (recently) maximal frequent itemsets over data streams, Proceedings of the 15th International Workshop on Research Issues in Data Engineering: Stream Data Mining and Application, pp. 11-18, (2005)
[2]  
Agrawal R., Srikant R., Fast algorithms for mining association rules, Proceedings of the 20th International Conference on Very Large Data Bases, pp. 487-499, (1994)
[3]  
Yu J., Chong Z., Zhang H., A false negative approach to mining frequent itemsets from high speed transactional data streams, Information Sciences, 176, 14, pp. 1986-2015, (2006)
[4]  
Mao G.J., Wu X.D., Liu C.N., Onling mining of maximal frequent itemsequences from data streams, Journal of Information Science, 33, 3, pp. 251-262, (2007)
[5]  
Han J., Pei J., Yin Y., Mining frequent patterns without candidate generation, Proceedings of the ACM International Conference on Management of Data, pp. 1-12, (2000)
[6]  
Giannella G., Han J., Yu P., Mining frequent patterns in data streams at multiple time granularities, Data Mining: Next Generation Challenges and Future Directions, pp. 191-212, (2004)
[7]  
Ming Y.L., Chen H.S., Interactive mining of frequent patterns in a data stream of time-fading models, Proceedings of the 8th International Conference on Intelligent Systems Design and Applications, pp. 513-518, (2008)
[8]  
Chi Y., Wang H., Yu P., MOMENT: maintaining closed frequent itemsets over a data stream sliding window, Proceedings of the 2004 IEEE International Conference on Data Mining, pp. 59-66, (2004)
[9]  
Gruenwald J.N., CFI-stream: mining closed frequent itemsets in data streams, Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 592-597, (2006)
[10]  
Ranganath B.N., Murty M.N., Stream-close: fast mining of closed frequent itemsets in high speed data streams, Proceeding of 2008 IEEE International Conference on Data Mining Workshops, pp. 516-525, (2008)