Decaying obsolete information in finding recent frequent itemsets over data streams

被引:0
作者
Chang, JH [1 ]
Lee, WS [1 ]
机构
[1] Yonsei Univ, Dept Comp Sci, Seoul 120749, South Korea
关键词
decaying obsolete information; recent frequent itemsets; data streams;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A data stream is a massive unbounded sequence of data elements continuously generated at a rapid rate. Consequently, the knowledge embedded in a data stream is likely to be changed as time goes by. However, most of mining algorithms or frequency approximation algorithms fora data stream are not able to extract the recent change of information in a data stream adaptively. This is because the obsolete information of old transactions which may be no longer useful or possibly invalid at present is regarded as important as that of recent transactions. This paper proposes an information decay method for finding recent frequent itemsets in a data stream. The effect of old transactions on the mining result of a data steam is gradually diminished as time goes by. Furthermore, the decay rate of information can be flexibly adjusted, which enables a user to define the desired life-time of the information of a transaction in a data stream.
引用
收藏
页码:1588 / 1592
页数:5
相关论文
共 8 条
[1]  
Agrawal R, 1994, P 20 INT C VER LARG, V1215, P487
[2]  
Charikar M., 2002, P 29 INT C AUT LANG, P693, DOI 10.1007/3-540-45465-9_59
[3]  
Datar M, 2002, SIAM PROC S, P635
[4]  
JAVITZ HS, 1994, A010 NIDES
[5]  
LEE CH, 2001, P 10 INT C INF KNOWL, P263
[6]  
Manku GS., 2002, P 28 INT C VER LARG, P346, DOI 10.1016/B978-155860869-6/50038-X
[7]  
Yi B.-K., 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073), P13, DOI 10.1109/ICDE.2000.839383
[8]  
[No title captured]