Mining frequent closed patterns with item constraints in data streams

被引:0
作者
Hu, Wei-Cheng [1 ]
Wang, Ben-Nian [1 ]
Cheng, Zhuan-Liu [1 ]
机构
[1] Tongling Coll, Dept Comp Sci, Tongling 244000, Peoples R China
来源
PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7 | 2008年
关键词
data mining; data streams; association rule; frequent closed itemsets;
D O I
10.1109/ICMLC.2008.4620417
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In order to efficiently filter the useful association rules through a large number of mined rules, some item constraints that are boolean expresssions are integrated into the associations discovery algorithm. The set of frequent closed patterns uniquely determines the complete set of all frequent patterns, and it can be orders of magnitude smaller than the latter. According to the features of data streams, a new algorithm, call DSCFCI, is proposed for mining frequent closed patterns with item constraints in data streams. The data stream is divided into a set of segments, and a new data structure called DSCFCI-tree is used to store the potential frequent closed patterns with item constraints dynamically. With the arrival of each batch of data, the algorithm builds a corresponding local DSCFCI-tree firstly, then updates and prunes the global DSCFCI-tree effectively to mine the frequent closed patterns with item constraints in the entire data stream. The experiments and analysis show that the algorithm has good performance.
引用
收藏
页码:274 / 280
页数:7
相关论文
共 6 条
[1]  
ARASU A, 2004, P 23 ACM SIGMOD SIGA
[2]  
DATAR M, 2002, P 13 ANN ACM SIAM S
[3]  
Giannella C., 2003, NEXT GENERATION DATA
[4]  
MANKU G, 2002, 28 INT C VER LARG DA
[5]  
PASQUIER N, 1999, P 17 INT C DAT THEOR
[6]  
Wang J., 2003, P 9 ACM SIGKDD INT C