Sliding Window- based Frequent Itemsets Mining over Data Streams using Tail Pointer Table

被引:4
作者
Wang, Le [1 ,2 ,3 ]
Feng, Lin [1 ,2 ]
Jin, Bo [1 ,2 ]
机构
[1] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian 116024, Peoples R China
[2] Dalian Univ Technol, Sch Innovat Expt, Dalian 116024, Peoples R China
[3] Ningbo Dahongying Univ, Sch Informat Engn, Ningbo 315175, Zhejiang, Peoples R China
关键词
data mining; data streams; frequent itemsets; sliding window; tail pointer table;
D O I
10.1080/18756891.2013.859860
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mining frequent itemsets over transaction data streams is critical for many applications, such as wireless sensor networks, analysis of retail market data, and stock market predication. The sliding window method is an important way of mining frequent itemsets over data streams. The speed of the sliding window is affected not only by the efficiency of the mining algorithm, but also by the efficiency of updating data. In this paper, we propose a new data structure with a Tail Pointer Table and a corresponding mining algorithm; we also propose a algorithm COFI2, a revised version of the frequent itemsets mining algorithm COFI (Co-Occurrence Frequent-Item), to reduce the temporal and memory requirements. Further, theoretical analysis and experiments are carried out to prove their effectiveness.
引用
收藏
页码:25 / 36
页数:12
相关论文
共 13 条
[1]   Efficient Tree Structures for High Utility Pattern Mining in Incremental Databases [J].
Ahmed, Chowdhury Farhan ;
Tanbeer, Syed Khairuzzaman ;
Jeong, Byeong-Soo ;
Lee, Young-Koo .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (12) :1708-1721
[2]   DBV-Miner: A Dynamic Bit-Vector approach for fast mining frequent closed itemsets [J].
Bay Vo ;
Hong, Tzung-Pei ;
Bac Le .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (08) :7196-7206
[3]   Catch the moment: maintaining closed frequent itemsets over a data stream sliding window [J].
Chi, Yun ;
Wang, Haixun ;
Yu, Philip S. ;
Muntz, Richard R. .
KNOWLEDGE AND INFORMATION SYSTEMS, 2006, 10 (03) :265-294
[4]   A dynamic layout of sliding window for frequent itemset mining over data streams [J].
Deypir, Mahmood ;
Sadreddini, Mohammad Hadi .
JOURNAL OF SYSTEMS AND SOFTWARE, 2012, 85 (03) :746-759
[5]   Max-FISM: Mining (recently) maximal frequent itemsets over data streams using the sliding window model [J].
Farzanyar, Zahra ;
Kangavari, Mohammadreza ;
Cercone, Nick .
COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2012, 64 (06) :1706-1718
[6]   Discovery of maximum length frequent itemsets [J].
Hu, Tianming ;
Sung, Sam Yuan ;
Xiong, Hui ;
Fu, Qian .
INFORMATION SCIENCES, 2008, 178 (01) :69-87
[7]  
Leung CKS, 2008, LECT NOTES COMPUT SC, V5071, P2, DOI 10.1007/978-3-540-70504-8_2
[8]   Isolated items discarding strategy for discovering high utility itemsets [J].
Li, Yu-Chiang ;
Yeh, Jieh-Shan ;
Chang, Chin-Chen .
DATA & KNOWLEDGE ENGINEERING, 2008, 64 (01) :198-217
[9]   A new mining approach for uncertain databases using CUFP trees [J].
Lin, Chun-Wei ;
Hong, Tzung-Pei .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (04) :4084-4093
[10]   An effective tree structure for mining high utility itemsets [J].
Lin, Chun-Wei ;
Hong, Tzung-Pei ;
Lu, Wen-Hsiang .
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (06) :7419-7424