Approximate mining of maximal frequent itemsets in data streams with different window models

被引：10

作者：

Li, Hua-Fu ^{[1
]}

Lee, Suh-Yin ^{[2
]}

机构：

[1] Kainan Univ, Dept Comp Sci, Tao Yuan 338, Taiwan

[2] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu 300, Taiwan

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2008年 / 35卷 / 03期

关键词：

data mining; data streams; maximal frequent itemsets; one-pass mining; approximate mining;

D O I：

10.1016/j.eswa.2007.07.046

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A data stream is a massive, open-ended sequence of data elements continuously generated at a rapid rate. Mining data streams is more difficult than mining static databases because the huge, high-speed and continuous characteristics of streaming data. In this paper, we propose a new one-pass algorithm called DSM-MFI (stands for Data Stream Mining for Maximal Frequent Itemsets), which mines the set of all maximal frequent itemsets in landmark windows over data streams. A new summary data structure called summary frequent itemset forest (abbreviated as SFI-forest) is developed for incremental maintaining the essential information about maximal frequent itemsets embedded in the stream so far. Theoretical analysis and experimental studies show that the proposed algorithm is efficient and scalable for mining the set of all maximal frequent itemsets over the entire history of the data streams. (c) 2007 Elsevier Ltd. All rights reserved.

引用

页码：781 / 789

页数：9

共 50 条

[31] DSM-FI: an efficient algorithm for mining frequent itemsets in data streams
Hua-Fu Li
Man-Kwan Shan
Suh-Yin Lee
Knowledge and Information Systems, 2008, 17 : 79 - 97
[32] On the design of hardware-software architectures for frequent itemsets mining on data streams
Lázaro Bustio-Martínez
René Cumplido
Raudel Hernández-León
José M. Bande-Serrano
Claudia Feregrino-Uribe
Journal of Intelligent Information Systems, 2018, 50 : 415 - 440
[33] On the design of hardware-software architectures for frequent itemsets mining on data streams
Bustio-Martinez, Lazaro
Cumplido, Rene
Hernandez-Leon, Raudel
Bande-Serrano, Jose M.
Feregrino-Uribe, Claudia
JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2018, 50 (03) : 415 - 440
[34] DSM-FI: an efficient algorithm for mining frequent itemsets in data streams
Li, Hua-Fu
Shan, Man-Kwan
Lee, Suh-Yin
KNOWLEDGE AND INFORMATION SYSTEMS, 2008, 17 (01) : 79 - 97
[35] Mining maximal frequent itemsets by a boolean based approach
Salleb, A
Maazouzi, Z
Vrain, C
ECAI 2002: 15TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2002, 77 : 385 - 389
[36] Mining Frequent Itemsets from Online Data Streams: Comparative Study
Nabil, HebaTallah Mohamed
Eldin, Ahmed Sharaf
Belal, Mohamed Abd El-Fattah
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2013, 4 (07) : 117 - 125
[37] An Efficient Frequent Closed Itemsets Mining Algorithm Over Data Streams
Tan, Jun
Bu, Yingyong
Yang, Bo
2009 INTERNATIONAL CONFERENCE ON INFORMATION MANAGEMENT, INNOVATION MANAGEMENT AND INDUSTRIAL ENGINEERING, VOL 3, PROCEEDINGS, 2009, : 65 - +
[38] An Efficient Frequent Closed Itemsets Mining Algorithm Over Data Streams
Tan, Jun
Yu, Shao-jun
2011 SECOND INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND EDUCATION APPLICATION (ICEA 2011), 2011, : 197 - 201
[39] Mining recent frequent itemsets in sliding windows over data streams
Congying Han
Lijun Xu
Guoping He
COMPUTING AND INFORMATICS, 2008, 27 (03) : 315 - 339
[40] Efficient Data Streams Based Closed Frequent Itemsets Mining Algorithm
Tan, Jun
ADVANCES IN CIVIL ENGINEERING II, PTS 1-4, 2013, 256-259 : 2910 - 2913

← 1 2 3 4 5 →