Sliding Window- based Frequent Itemsets Mining over Data Streams using Tail Pointer Table

被引:4
|
作者
Wang, Le [1 ,2 ,3 ]
Feng, Lin [1 ,2 ]
Jin, Bo [1 ,2 ]
机构
[1] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian 116024, Peoples R China
[2] Dalian Univ Technol, Sch Innovat Expt, Dalian 116024, Peoples R China
[3] Ningbo Dahongying Univ, Sch Informat Engn, Ningbo 315175, Zhejiang, Peoples R China
关键词
data mining; data streams; frequent itemsets; sliding window; tail pointer table;
D O I
10.1080/18756891.2013.859860
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mining frequent itemsets over transaction data streams is critical for many applications, such as wireless sensor networks, analysis of retail market data, and stock market predication. The sliding window method is an important way of mining frequent itemsets over data streams. The speed of the sliding window is affected not only by the efficiency of the mining algorithm, but also by the efficiency of updating data. In this paper, we propose a new data structure with a Tail Pointer Table and a corresponding mining algorithm; we also propose a algorithm COFI2, a revised version of the frequent itemsets mining algorithm COFI (Co-Occurrence Frequent-Item), to reduce the temporal and memory requirements. Further, theoretical analysis and experiments are carried out to prove their effectiveness.
引用
收藏
页码:25 / 36
页数:12
相关论文
共 50 条
  • [21] A sliding window method for finding recently frequent itemsets over Online data streams
    Chang, JH
    Lee, WS
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2004, 20 (04) : 753 - 762
  • [22] Research on Data stream Mining Algorithm for Frequent Itemsets Based on Sliding Window Model
    Wang, Hongmei
    Li, Fentian
    Tang, Dongkai
    Wang, Zeru
    2017 IEEE 2ND INTERNATIONAL CONFERENCE ON BIG DATA ANALYSIS (ICBDA), 2017, : 264 - 268
  • [23] An adaptive approximation method to discover frequent itemsets over sliding-window-based data streams
    Li, Chao-Wei
    Jea, Kuen-Fang
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (10) : 13386 - 13404
  • [24] Efficient strategies for incremental mining of frequent closed itemsets over data streams
    Liu, Junqiang
    Ye, Zhousheng
    Yang, Xiangcai
    Wang, Xueling
    Shen, Linjie
    Jiang, Xiaoning
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 191
  • [25] A Fast Algorithm for Mining Frequent Closed Itemsets over Stream Sliding Window
    Yen, Show-Jane
    Wu, Cheng-Wei
    Lee, Yue-Shi
    Tseng, Vincent S.
    Hsieh, Chaur-Heh
    IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ 2011), 2011, : 996 - 1002
  • [26] Approximate mining of maximal frequent itemsets in data streams with different window models
    Li, Hua-Fu
    Lee, Suh-Yin
    EXPERT SYSTEMS WITH APPLICATIONS, 2008, 35 (03) : 781 - 789
  • [27] Fast Mining of Closed Frequent Itemsets in Data Streams
    Mao Yimin
    Chen Zhigang
    Liu Lixin
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY, PTS 1-4, 2013, 263-266 : 231 - +
  • [28] An Efficient Algorithm for Sliding Window-Based Weighted Frequent Pattern Mining over Data Streams
    Ahmed, Chowdhury Farhan
    Tanbeer, Syed Khairuzzaman
    Jeong, Byeong-Soo
    Lee, Young-Koo
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2009, E92D (07): : 1369 - 1381
  • [29] An efficient algorithm for mining maximal frequent itemsets over data streams
    Mao Y.
    Yang L.
    Li H.
    Chen Z.
    Liu L.
    Gaojishu Tongxin/Chinese High Technology Letters, 2010, 20 (03): : 246 - 252
  • [30] A Mining Maximal Frequent Itemsets over the Entire History of Data Streams
    Mao, Yinmin
    Li, Hong
    Yang, Lumin
    Chen, Zhigang
    Liu, Lixin
    FIRST INTERNATIONAL WORKSHOP ON DATABASE TECHNOLOGY AND APPLICATIONS, PROCEEDINGS, 2009, : 413 - 417