Incremental mining of closed inter-transaction itemsets over data stream sliding windows

被引:7
作者
Chiu, Shih-Chuan [1 ]
Li, Hua-Fu [2 ]
Huang, Jiun-Long [1 ]
You, Hsin-Han [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Comp Sci, Hsinchu 300, Taiwan
[2] Kainan Univ, Dept Informat Management, Tao Yuan 338, Taiwan
关键词
data mining; data streams; incremental mining; stream sliding window mining; frequent inter-transaction itemsets; PATTERNS;
D O I
10.1177/0165551511401539
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mining inter-transaction association rules is one of the most interesting issues in data mining research. However, in a data stream environment the previous approaches are unable to find the result of the new-incoming data and the original database without re-computing the whole database. In this paper, we propose an incremental mining algorithm, called DSM-CITI (Data Stream Mining for Closed Inter-Transaction Itemsets), for discovering the set of all frequent inter-transaction itemsets from data streams. In the framework of DSM-CITI, a new in-memory summary data structure, ITP-tree, is developed to maintain frequent inter-transaction itemsets. Moreover, algorithm DSM-CITI is able to construct ITP-tree incrementally and uses the property to avoid unnecessary updates. Experimental studies show that the proposed algorithm is efficient and scalable for mining frequent inter-transaction itemsets over stream sliding windows.
引用
收藏
页码:208 / 220
页数:13
相关论文
共 25 条
[1]  
Babcock B., 2002, PODS, P1, DOI [DOI 10.1145/543613.543615, 10.1145/543613.543615]
[2]   Efficient mining method for retrieving sequential patterns over online data streams [J].
Chang, JH ;
Lee, WS .
JOURNAL OF INFORMATION SCIENCE, 2005, 31 (05) :420-432
[3]  
CHEN J, 2005, P 3 INT C NETW MOB C, P519
[4]  
CHEN Y, 2002, P 28 INT C VER LARG, P323
[5]   InterTARM: FP-tree based Framework for Mining Inter-transaction Association Rules from Stock Market Data [J].
Chhinkaniwala, Hitesh ;
Thilagam, P. Santhi .
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, 2008, :513-517
[6]  
Chiu SC, 2009, IEEE INT CON MULTI, P1170, DOI 10.1109/ICME.2009.5202708
[7]   IFCIA: An efficient algorithm for mining intertransaction frequent closed itemsets [J].
Dong, Jie ;
Han, Min .
FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2007, :678-+
[8]   A template model for multidimensional inter-transactional association rules [J].
Feng, L ;
Yu, JX ;
Lu, HJ ;
Han, JW .
VLDB JOURNAL, 2002, 11 (02) :153-175
[9]   Efficiently mining maximal frequent itemsets [J].
Gouda, K ;
Zaki, MJ .
2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, :163-170
[10]   Clustering data streams [J].
Guha, S ;
Mishra, N ;
Motwani, R ;
O'Callaghan, L .
41ST ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, PROCEEDINGS, 2000, :359-366