Sliding window-based frequent pattern mining over data streams

被引:114
|
作者
Tanbeer, Syed Khairuzzaman [1 ]
Ahmed, Chowdhury Farhan [1 ]
Jeong, Byeong-Soo [1 ]
Lee, Young-Koo [1 ]
机构
[1] Kyung Hee Univ, Dept Comp Engn, Youngin Si 446701, Gyeonggi Do, South Korea
关键词
Frequent pattern; Data stream; Sliding window; Tree restructuring; ASSOCIATION RULES; EFFICIENT ALGORITHM; ITEMSETS; DISCOVERY;
D O I
10.1016/j.ins.2009.07.012
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Finding frequent patterns in a continuous stream of transactions is critical for many applications such as retail market data analysis, network monitoring, web usage mining, and stock market prediction. Even though numerous frequent pattern mining algorithms have been developed over the past decade, new solutions for handling stream data are still required due to the continuous, unbounded, and ordered sequence of data elements generated at a rapid rate in a data stream. Therefore, extracting frequent patterns from more recent data can enhance the analysis of stream data. In this paper, we propose an efficient technique to discover the complete set of recent frequent patterns from a high-speed data stream over a sliding window. We develop a Compact Pattern Stream tree (CPS-tree) to capture the recent stream data content and efficiently remove the obsolete, old stream data content. We also introduce the concept of dynamic tree restructuring in our CPS-tree to produce a highly compact frequency-descending tree structure at runtime. The complete set of recent frequent patterns is obtained from the CPS-tree of the current window using an FP-growth mining technique. Extensive experimental analyses show that our CPS-tree is highly efficient in terms of memory and time complexity when finding recent frequent patterns from a high-speed data stream. (c) 2009 Elsevier Inc. All rights reserved.
引用
收藏
页码:3843 / 3865
页数:23
相关论文
共 50 条
  • [1] An Efficient Algorithm for Sliding Window-Based Weighted Frequent Pattern Mining over Data Streams
    Ahmed, Chowdhury Farhan
    Tanbeer, Syed Khairuzzaman
    Jeong, Byeong-Soo
    Lee, Young-Koo
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2009, E92D (07): : 1369 - 1381
  • [2] A Sliding Window-Based Approach for Mining Frequent Weighted Patterns Over Data Streams
    Bui, Huong
    Nguyen-Hoang, Tu-Anh
    Vo, Bay
    Nguyen, Ham
    Le, Tuong
    IEEE ACCESS, 2021, 9 : 56318 - 56329
  • [3] Sliding window based weighted maximal frequent pattern mining over data streams
    Lee, Gangin
    Yun, Unil
    Ryu, Keun Ho
    EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (02) : 694 - 708
  • [4] Sliding Window-based Frequent Itemsets Mining over Data Streams using Tail Pointer Table
    Le Wang
    Lin Feng
    Bo Jin
    International Journal of Computational Intelligence Systems, 2014, 7 : 25 - 36
  • [5] Frequent pattern mining algorithm for uncertain data streams based on sliding window
    Yang, Junrui
    Yang, Cai
    Wei, Yanjun
    2016 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC), VOL. 2, 2016, : 265 - 268
  • [6] A Variable Sliding Window Algorithm Based on Concept Drift for Frequent Pattern Mining Over Data Streams*
    Yin, Yue
    Li, Peng
    Chen, Jing
    2022 IEEE 28TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, ICPADS, 2022, : 818 - 825
  • [7] EclatDS: An efficient sliding window based frequent pattern mining method for data streams
    Deypir, Mahmood
    Sadreddini, Mohammad Hadi
    INTELLIGENT DATA ANALYSIS, 2011, 15 (04) : 571 - 587
  • [8] Mining frequent patterns in an arbitrary sliding window over data streams
    Li, Guohui
    Chen, Hui
    Yang, Bing
    Chen, Gang
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2008, 4947 : 496 - 503
  • [9] Mining maximal frequent itemsets in a sliding window over data streams
    Mao Y.
    Li H.
    Yang L.
    Liu L.
    Gaojishu Tongxin/Chinese High Technology Letters, 2010, 20 (11): : 1142 - 1148
  • [10] A sliding window based algorithm for frequent closed itemset mining over data streams
    Nori, Fatemeh
    Deypir, Mahmood
    Sadreddini, Mohamad Hadi
    JOURNAL OF SYSTEMS AND SOFTWARE, 2013, 86 (03) : 615 - 623