A time- and memory-efficient frequent itemset discovering algorithm for association rule mining

被引：1

作者：

Ivancsy, Renata ^{[1
,2
]}

Vajk, Istvan ^{[1
,2
]}

机构：

[1] Budapest Univ Technol & Econ, Dept Automat & Appl, 3 Goldmann Gy Ter, H-1111 Budapest, Hungary

[2] HAS BUTE Control Res Grp, H-1111 Budapest, Hungary

来源：

INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY | 2006年 / 27卷 / 04期

关键词：

association rule mining; frequent itemset; Apriori algorithm; FP-growth algorithm;

D O I：

10.1504/IJCAT.2006.011998

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Frequent itemset discovering is a highly researched area in the field of data mining. The algorithms dealing with this problem have several advantages and disadvantages regarding their time complexity, I/O cost and memory requirement. There are algorithms that have moderate memory usage but high I/O cost, thus the execution time of them is high; such methods are for example the level-wise algorithms. Other methods have advantageous time behaviour; however, they are memory intensive, like the two-phase algorithms. In this paper, a novel algorithm, which is efficient both in time and memory, is proposed. The new algorithm discovers the small frequent itemsets quickly by taking advantage of the easy indexing opportunity of the suggested candidate storage structure. The main benefit of the novel algorithm is its advantageous time behaviour when using different types of datasets as well as its low I/O activity and moderate memory requirement.

引用

页码：270 / 280

页数：11

共 50 条

[1] A Heuristic Rule based Approximate Frequent Itemset Mining Algorithm
Li, Haifeng
Zhang, Yuejin
Zhang, Ning
Jia, Hengyue
PROMOTING BUSINESS ANALYTICS AND QUANTITATIVE MANAGEMENT OF TECHNOLOGY: 4TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT (ITQM 2016), 2016, 91 : 324 - 333
[2] New Spark solutions for distributed frequent itemset and association rule mining algorithms
Fernandez-Basso, Carlos
Ruiz, M. Dolores
Martin-Bautista, Maria J.
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (02): : 1217 - 1234
[3] New Spark solutions for distributed frequent itemset and association rule mining algorithms
Carlos Fernandez-Basso
M. Dolores Ruiz
Maria J. Martin-Bautista
Cluster Computing, 2024, 27 : 1217 - 1234
[4] AT-Mine: An Efficient Algorithm of Frequent Itemset Mining on Uncertain Dataset
Wang, Le
Feng, Lin
Wu, Mingfei
JOURNAL OF COMPUTERS, 2013, 8 (06) : 1417 - 1426
[5] HashEclat: an efficient frequent itemset algorithm
Zhang, Chunkai
Tian, Panbo
Zhang, Xudong
Liao, Qing
Jiang, Zoe L.
Wang, Xuan
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (11) : 3003 - 3016
[6] HashEclat: an efficient frequent itemset algorithm
Chunkai Zhang
Panbo Tian
Xudong Zhang
Qing Liao
Zoe L. Jiang
Xuan Wang
International Journal of Machine Learning and Cybernetics, 2019, 10 : 3003 - 3016
[7] MapReduce Based Frequent Itemset Mining Algorithm on Stream Data
Chaudhary, Hemant
Yadav, Deepak Kumar
Bhatnagar, Rajat
Chandrasekhar, Uddagiri
2015 GLOBAL CONFERENCE ON COMMUNICATION TECHNOLOGIES (GCCT), 2015, : 586 - 591
[8] A data mining proxy approach for efficient frequent itemset mining
Jeffrey Xu Yu
Zhiheng Li
Guimei Liu
The VLDB Journal, 2008, 17 : 947 - 970
[9] A data mining proxy approach for efficient frequent itemset mining
Yu, Jeffrey Xu
Li, Zhiheng
Liu, Guimei
VLDB JOURNAL, 2008, 17 (04): : 947 - 970
[10] Efficient frequent itemset mining methods over time-sensitive streams
Li, Haifeng
Zhang, Ning
Zhu, Jianming
Cao, Huaihu
Wang, Yue
KNOWLEDGE-BASED SYSTEMS, 2014, 56 : 281 - 298

← 1 2 3 4 5 →