Incremental Association Rule Mining With a Fast Incremental Updating Frequent Pattern Growth Algorithm

被引:25
作者
Thurachon, Wannasiri [1 ]
Kreesuradej, Worapoj [1 ]
机构
[1] King Mongkuts Inst Technol Ladkrabang, Fac Informat Technol, Bangkok 10520, Thailand
关键词
Itemsets; Databases; Data mining; Heuristic algorithms; Partitioning algorithms; Maintenance engineering; Clustering algorithms; Association rule mining; data mining; FP-tree; FP-growth; FPISC-tree; frequent itemset mining; incremental association rule mining; MAINTENANCE; ITEMSETS; GENERATION; DISCOVERY; DATABASES; TREES;
D O I
10.1109/ACCESS.2021.3071777
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
One of the most challenging tasks in association rule mining is that when a new incremental database is added to an original database, some existing frequent itemsets may become infrequent itemsets and vice versa. As a result, some previous association rules may become invalid and some new association rules may emerge. We designed a new, more efficient approach for incremental association rule mining using a Fast Incremental Updating Frequent Pattern growth algorithm (FIUFP-Growth), a new Incremental Conditional Pattern tree (ICP-tree), and a compact sub-tree suitable for incremental mining of frequent itemsets. This algorithm retrieves previous frequent itemsets that have already been mined from the original database and their support counts then use them to efficiently mine frequent itemsets from the updated database and ICP-tree, reducing the number of rescans of the original database. Our algorithm reduced usages of resource and time for unnecessary sub-tree construction compared to individual FP- Growth, FUFP-tree maintenance, Pre-FUFP, and FCFPIM algorithms. From the results, at 3% minimum support threshold, the average execution time for pattern growth mining of our algorithm performs 46% faster than FP- Growth, FUFP-tree, Pre-FUFP, and FCFPIM. This approach to incremental association rule mining and our experimental findings may directly benefit designers and developers of computer business intelligence methods.
引用
收藏
页码:55726 / 55741
页数:16
相关论文
共 40 条
[1]   A tree projection algorithm for generation of frequent item sets [J].
Agarwal, RC ;
Aggarwal, CC ;
Prasad, VVV .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2001, 61 (03) :350-371
[2]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[3]  
Agrawal R., 1994, VLDB, P487, DOI DOI 10.1016/J.INS.2006.09.018
[4]   Applications of association rule mining in health informatics: a survey [J].
Altaf, Wasif ;
Shahbaz, Muhammad ;
Guergachi, Aziz .
ARTIFICIAL INTELLIGENCE REVIEW, 2017, 47 (03) :313-340
[5]  
Amornchewin R, 2009, J UNIVERS COMPUT SCI, V15, P2409
[6]   Probability-based incremental association rule discovery algorithm [J].
Amornchewin, Ratchadaporn ;
Kreesuradej, Worapoj .
CSA 2008: INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND ITS APPLICATIONS, PROCEEDINGS, 2008, :212-215
[7]  
[Anonymous], 2018, INTRO DATA MINING
[8]  
Ariya Araya, 2013, 2013 IEEE 14th International Conference on Information Reuse & Integration (IRI), P432, DOI 10.1109/IRI.2013.6642503
[9]   An enhanced incremental association rule discovery with a lower minimum support [J].
Ariya, Araya ;
Kreesuradej, Worapoj .
ARTIFICIAL LIFE AND ROBOTICS, 2016, 21 (04) :466-477
[10]   Borders: An efficient algorithm for association generation in dynamic databases [J].
Aumann, Y ;
Feldman, R ;
Lipshtat, O ;
Manilla, H .
JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 1999, 12 (01) :61-73