A Fast Approach for Up-Scaling Frequent Itemsets

被引:6
作者
Chen, Runzi [1 ]
Zhao, Shuliang [2 ,3 ,4 ]
Liu, Mengmeng [5 ]
机构
[1] Hebei Normal Univ, Coll Math & Informat Sci, Shijiazhuang 050024, Hebei, Peoples R China
[2] Hebei Normal Univ, Coll Comp & Cyber Secur, Shijiazhuang 050024, Hebei, Peoples R China
[3] Hebei Prov Key Lab Network & Informat Secur, Shijiazhuang 050024, Hebei, Peoples R China
[4] Hebei Prov Engn Res Ctr Supply Chain Big Data Ana, Shijiazhuang 050024, Hebei, Peoples R China
[5] Zhangjiakou Univ, Lib, Zhangjiakou 075000, Peoples R China
关键词
Itemsets; Data mining; Partitioning algorithms; Memory management; Data structures; Task analysis; Up-scaling; up-scaling frequent itemsets; frequent itemset mining; data mining; POWER SETS; GENERATION; ALGORITHMS;
D O I
10.1109/ACCESS.2020.2995719
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid growth of data scale and diversification of demand, people have an urgent desire to extract useful frequent itemset from datasets of different scales. It is no doubt that the traditional method can solve the problem. However, the relationships among datasets of different scales are not fully utilized. A fast approach proposed in this paper is as follows: the frequent itemsets on the large-scale data are directly inferred based on the frequent itemsets that are belonged small-scale datasets, instead of mined from the large-scale dataset again on condition that the frequent itemsets on the small-scale datasets have been mined. We conduct extensive experiments on one synthetic data and four UCI data sets. The experimental results show that our algorithm is significantly faster and consumes less memory than these leading algorithms.
引用
收藏
页码:97141 / 97151
页数:11
相关论文
共 27 条
[1]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[2]  
[Anonymous], P 20 VLDB C SANT CHI
[3]   negFIN: An efficient algorithm for fast mining frequent itemsets [J].
Aryabarzan, Nader ;
Minaei-Bidgoli, Behrouz ;
Teshnehlab, Mohammad .
EXPERT SYSTEMS WITH APPLICATIONS, 2018, 105 :129-143
[4]   An efficient pattern growth approach for mining fault tolerant frequent itemsets [J].
Bashir, Shariq .
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 143
[5]   MAFIA: A maximal frequent itemset algorithm [J].
Burdick, D ;
Calimlim, M ;
Flannick, J ;
Gehrke, J ;
Yiu, TM .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (11) :1490-1504
[6]   DiffNodesets: An efficient structure for fast mining frequent itemsets [J].
Deng, Zhi-Hong .
APPLIED SOFT COMPUTING, 2016, 41 :214-223
[7]   Frequent Itemset Mining in Big Data With Effective Single Scan Algorithms [J].
Djenouri, Youcef ;
Djenouri, Djamel ;
Lin, Jerry Chun-Wei ;
Belhadi, Asma .
IEEE ACCESS, 2018, 6 :68013-68026
[8]   An Efficient Framework for Mining Association Rules in the Distributed Databases [J].
Goyal, Lalit Mohan ;
Beg, M. M. Sufyan ;
Ahmad, Tanvir .
COMPUTER JOURNAL, 2018, 61 (05) :645-657
[9]   Fast algorithms for frequent itemset mining using FP-trees [J].
Grahne, G ;
Zhu, JF .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (10) :1347-1362
[10]   Mining frequent patterns without candidate generation: A frequent-pattern tree approach [J].
Han, JW ;
Pei, J ;
Yin, YW ;
Mao, RY .
DATA MINING AND KNOWLEDGE DISCOVERY, 2004, 8 (01) :53-87