A time- and memory-efficient frequent itemset discovering algorithm for association rule mining

被引:1
作者
Ivancsy, Renata [1 ,2 ]
Vajk, Istvan [1 ,2 ]
机构
[1] Budapest Univ Technol & Econ, Dept Automat & Appl, 3 Goldmann Gy Ter, H-1111 Budapest, Hungary
[2] HAS BUTE Control Res Grp, H-1111 Budapest, Hungary
关键词
association rule mining; frequent itemset; Apriori algorithm; FP-growth algorithm;
D O I
10.1504/IJCAT.2006.011998
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Frequent itemset discovering is a highly researched area in the field of data mining. The algorithms dealing with this problem have several advantages and disadvantages regarding their time complexity, I/O cost and memory requirement. There are algorithms that have moderate memory usage but high I/O cost, thus the execution time of them is high; such methods are for example the level-wise algorithms. Other methods have advantageous time behaviour; however, they are memory intensive, like the two-phase algorithms. In this paper, a novel algorithm, which is efficient both in time and memory, is proposed. The new algorithm discovers the small frequent itemsets quickly by taking advantage of the easy indexing opportunity of the suggested candidate storage structure. The main benefit of the novel algorithm is its advantageous time behaviour when using different types of datasets as well as its low I/O activity and moderate memory requirement.
引用
收藏
页码:270 / 280
页数:11
相关论文
共 50 条
[1]   A Heuristic Rule based Approximate Frequent Itemset Mining Algorithm [J].
Li, Haifeng ;
Zhang, Yuejin ;
Zhang, Ning ;
Jia, Hengyue .
PROMOTING BUSINESS ANALYTICS AND QUANTITATIVE MANAGEMENT OF TECHNOLOGY: 4TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT (ITQM 2016), 2016, 91 :324-333
[2]   New Spark solutions for distributed frequent itemset and association rule mining algorithms [J].
Fernandez-Basso, Carlos ;
Ruiz, M. Dolores ;
Martin-Bautista, Maria J. .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (02) :1217-1234
[3]   New Spark solutions for distributed frequent itemset and association rule mining algorithms [J].
Carlos Fernandez-Basso ;
M. Dolores Ruiz ;
Maria J. Martin-Bautista .
Cluster Computing, 2024, 27 :1217-1234
[4]   AT-Mine: An Efficient Algorithm of Frequent Itemset Mining on Uncertain Dataset [J].
Wang, Le ;
Feng, Lin ;
Wu, Mingfei .
JOURNAL OF COMPUTERS, 2013, 8 (06) :1417-1426
[5]   HashEclat: an efficient frequent itemset algorithm [J].
Chunkai Zhang ;
Panbo Tian ;
Xudong Zhang ;
Qing Liao ;
Zoe L. Jiang ;
Xuan Wang .
International Journal of Machine Learning and Cybernetics, 2019, 10 :3003-3016
[6]   HashEclat: an efficient frequent itemset algorithm [J].
Zhang, Chunkai ;
Tian, Panbo ;
Zhang, Xudong ;
Liao, Qing ;
Jiang, Zoe L. ;
Wang, Xuan .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (11) :3003-3016
[7]   MapReduce Based Frequent Itemset Mining Algorithm on Stream Data [J].
Chaudhary, Hemant ;
Yadav, Deepak Kumar ;
Bhatnagar, Rajat ;
Chandrasekhar, Uddagiri .
2015 GLOBAL CONFERENCE ON COMMUNICATION TECHNOLOGIES (GCCT), 2015, :586-591
[8]   A data mining proxy approach for efficient frequent itemset mining [J].
Yu, Jeffrey Xu ;
Li, Zhiheng ;
Liu, Guimei .
VLDB JOURNAL, 2008, 17 (04) :947-970
[9]   A data mining proxy approach for efficient frequent itemset mining [J].
Jeffrey Xu Yu ;
Zhiheng Li ;
Guimei Liu .
The VLDB Journal, 2008, 17 :947-970
[10]   Efficient frequent itemset mining methods over time-sensitive streams [J].
Li, Haifeng ;
Zhang, Ning ;
Zhu, Jianming ;
Cao, Huaihu ;
Wang, Yue .
KNOWLEDGE-BASED SYSTEMS, 2014, 56 :281-298