A time- and memory-efficient frequent itemset discovering algorithm for association rule mining

被引:1
|
作者
Ivancsy, Renata [1 ,2 ]
Vajk, Istvan [1 ,2 ]
机构
[1] Budapest Univ Technol & Econ, Dept Automat & Appl, 3 Goldmann Gy Ter, H-1111 Budapest, Hungary
[2] HAS BUTE Control Res Grp, H-1111 Budapest, Hungary
关键词
association rule mining; frequent itemset; Apriori algorithm; FP-growth algorithm;
D O I
10.1504/IJCAT.2006.011998
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Frequent itemset discovering is a highly researched area in the field of data mining. The algorithms dealing with this problem have several advantages and disadvantages regarding their time complexity, I/O cost and memory requirement. There are algorithms that have moderate memory usage but high I/O cost, thus the execution time of them is high; such methods are for example the level-wise algorithms. Other methods have advantageous time behaviour; however, they are memory intensive, like the two-phase algorithms. In this paper, a novel algorithm, which is efficient both in time and memory, is proposed. The new algorithm discovers the small frequent itemsets quickly by taking advantage of the easy indexing opportunity of the suggested candidate storage structure. The main benefit of the novel algorithm is its advantageous time behaviour when using different types of datasets as well as its low I/O activity and moderate memory requirement.
引用
收藏
页码:270 / 280
页数:11
相关论文
共 50 条
  • [1] A Heuristic Rule based Approximate Frequent Itemset Mining Algorithm
    Li, Haifeng
    Zhang, Yuejin
    Zhang, Ning
    Jia, Hengyue
    PROMOTING BUSINESS ANALYTICS AND QUANTITATIVE MANAGEMENT OF TECHNOLOGY: 4TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT (ITQM 2016), 2016, 91 : 324 - 333
  • [2] New Spark solutions for distributed frequent itemset and association rule mining algorithms
    Fernandez-Basso, Carlos
    Ruiz, M. Dolores
    Martin-Bautista, Maria J.
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (02): : 1217 - 1234
  • [3] New Spark solutions for distributed frequent itemset and association rule mining algorithms
    Carlos Fernandez-Basso
    M. Dolores Ruiz
    Maria J. Martin-Bautista
    Cluster Computing, 2024, 27 : 1217 - 1234
  • [4] AT-Mine: An Efficient Algorithm of Frequent Itemset Mining on Uncertain Dataset
    Wang, Le
    Feng, Lin
    Wu, Mingfei
    JOURNAL OF COMPUTERS, 2013, 8 (06) : 1417 - 1426
  • [5] HashEclat: an efficient frequent itemset algorithm
    Zhang, Chunkai
    Tian, Panbo
    Zhang, Xudong
    Liao, Qing
    Jiang, Zoe L.
    Wang, Xuan
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (11) : 3003 - 3016
  • [6] HashEclat: an efficient frequent itemset algorithm
    Chunkai Zhang
    Panbo Tian
    Xudong Zhang
    Qing Liao
    Zoe L. Jiang
    Xuan Wang
    International Journal of Machine Learning and Cybernetics, 2019, 10 : 3003 - 3016
  • [7] MapReduce Based Frequent Itemset Mining Algorithm on Stream Data
    Chaudhary, Hemant
    Yadav, Deepak Kumar
    Bhatnagar, Rajat
    Chandrasekhar, Uddagiri
    2015 GLOBAL CONFERENCE ON COMMUNICATION TECHNOLOGIES (GCCT), 2015, : 586 - 591
  • [8] A data mining proxy approach for efficient frequent itemset mining
    Jeffrey Xu Yu
    Zhiheng Li
    Guimei Liu
    The VLDB Journal, 2008, 17 : 947 - 970
  • [9] A data mining proxy approach for efficient frequent itemset mining
    Yu, Jeffrey Xu
    Li, Zhiheng
    Liu, Guimei
    VLDB JOURNAL, 2008, 17 (04): : 947 - 970
  • [10] Efficient frequent itemset mining methods over time-sensitive streams
    Li, Haifeng
    Zhang, Ning
    Zhu, Jianming
    Cao, Huaihu
    Wang, Yue
    KNOWLEDGE-BASED SYSTEMS, 2014, 56 : 281 - 298