Approximate mining of maximal frequent itemsets in data streams with different window models

被引:10
|
作者
Li, Hua-Fu [1 ]
Lee, Suh-Yin [2 ]
机构
[1] Kainan Univ, Dept Comp Sci, Tao Yuan 338, Taiwan
[2] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu 300, Taiwan
关键词
data mining; data streams; maximal frequent itemsets; one-pass mining; approximate mining;
D O I
10.1016/j.eswa.2007.07.046
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A data stream is a massive, open-ended sequence of data elements continuously generated at a rapid rate. Mining data streams is more difficult than mining static databases because the huge, high-speed and continuous characteristics of streaming data. In this paper, we propose a new one-pass algorithm called DSM-MFI (stands for Data Stream Mining for Maximal Frequent Itemsets), which mines the set of all maximal frequent itemsets in landmark windows over data streams. A new summary data structure called summary frequent itemset forest (abbreviated as SFI-forest) is developed for incremental maintaining the essential information about maximal frequent itemsets embedded in the stream so far. Theoretical analysis and experimental studies show that the proposed algorithm is efficient and scalable for mining the set of all maximal frequent itemsets over the entire history of the data streams. (c) 2007 Elsevier Ltd. All rights reserved.
引用
收藏
页码:781 / 789
页数:9
相关论文
共 50 条
  • [21] Mining frequent itemsets in data streams within a time horizon
    Troiano, Luigi
    Scibelli, Giacomo
    DATA & KNOWLEDGE ENGINEERING, 2014, 89 : 21 - 37
  • [22] Sliding window based weighted maximal frequent pattern mining over data streams
    Lee, Gangin
    Yun, Unil
    Ryu, Keun Ho
    EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (02) : 694 - 708
  • [23] Mining Frequent Itemsets in Data Streams Based on Genetic Algorithm
    Han, Chong
    Sun, Lijuan
    Guo, Jian
    Chen, Xiaodong
    2013 15TH IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT), 2013, : 748 - 753
  • [24] New Policy of Maximal Frequent Itemsets in Data Stream Mining
    Xu, ChongHuan
    Ju, ChunHua
    ADVANCED MECHANICAL ENGINEERING, PTS 1 AND 2, 2010, 26-28 : 118 - +
  • [25] A novel approach for data stream maximal frequent itemsets mining
    Xu C.-H.
    Xu, Chong-Huan (talentxch@163.com), 1600, Inderscience Enterprises Ltd., 29, route de Pre-Bois, Case Postale 856, CH-1215 Geneva 15, CH-1215, Switzerland (10): : 224 - 231
  • [26] Distributed mining of maximal frequent itemsets on a Data Grid system
    Luo, Congnan
    Pereira, Anil L.
    Chung, Soon M.
    JOURNAL OF SUPERCOMPUTING, 2006, 37 (01) : 71 - 90
  • [27] A sliding window algorithm for mining frequent itemsets on data stream
    Liu, Junqiang
    Li, Xiurong
    DCABES 2006 PROCEEDINGS, VOLS 1 AND 2, 2006, : 637 - 639
  • [28] Uncertain Frequent Itemsets Mining Algorithm on Data Streams with Constraints
    Yu, Qun
    Tang, Ke-Ming
    Tang, Shi-Xi
    Lv, Xin
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2016, 2016, 9937 : 192 - 201
  • [29] Distributed Mining of Maximal Frequent Itemsets on a Data Grid System
    Congnan Luo
    Anil L. Pereira
    Soon M. Chung
    The Journal of Supercomputing, 2006, 37 : 71 - 90
  • [30] Efficient strategies for incremental mining of frequent closed itemsets over data streams
    Liu, Junqiang
    Ye, Zhousheng
    Yang, Xiangcai
    Wang, Xueling
    Shen, Linjie
    Jiang, Xiaoning
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 191