An Algorithm for Fast Mining Top-rank-k Frequent Patterns based on Node-list Data Structure

被引:1
作者
Wang, Qian [1 ,2 ,3 ]
Ren, Jiadong [1 ,2 ]
Davis, Darryl N. [3 ]
Cheng, Yongqiang [3 ]
机构
[1] Yanshan Univ, Coll Informat Sci & Engn, Qinhuangdao, Hebei, Peoples R China
[2] Comp Virtual Technol & Syst Integrat Lab Hebei Pr, Qinhuangdao, Hebei, Peoples R China
[3] Univ Hull, Dept Comp Sci, Kingston Upon Hull, N Humberside, England
基金
中国国家自然科学基金;
关键词
Data mining; frequent pattern; top-rank-k frequent pattern; FTPP-tree; Node-list; EFFICIENT; ITEMSETS; TREE;
D O I
10.1080/10798587.2017.1340135
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Frequent pattern mining usually requires much run time and memory usage. In some applications, only the patterns with top frequency rank are needed. Because of the limited pattern numbers, quality of the results is even more important than time and memory consumption. A Frequent Pattern algorithm for mining Top-rank-K patterns, FP_TopK, is proposed. It is based on a Node-list data structure extracted from FTPP-tree. Each node is with one or more triple sets, which contain supports, preorder and postorder transversal orders for candidate pattern generation and top-rank-k frequent pattern mining. FP_TopK uses the minimal support threshold for pruning strategy to guarantee that each pattern in the top-rank-k table is really frequent and this further improves the efficiency. Experiments are conducted to compare FP_TopK with iNTK and BTK on four datasets. The results show that FP_TopK achieves better performance.
引用
收藏
页码:399 / 404
页数:6
相关论文
共 23 条
[1]  
Agrawal R., P 20 INT C VERY LARG
[2]  
Ahmed T. S., 2008, INT J SOFT COMPUTING, V3, P195
[3]   INTELLIGENT LOAD CONTROL FOR FREQUENCY REGULATION IN MICROGRIDS [J].
Awad, Bieshoy ;
Ekanayake, Janaka ;
Jenkins, Nick .
INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2010, 16 (02) :303-318
[4]   A novel approach for mining maximal frequent patterns [J].
Bay Vo ;
Sang Pham ;
Tuong Le ;
Deng, Zhi-Hong .
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 73 :178-186
[5]   An effective approach for maintenance of pre-large-based frequent-itemset lattice in incremental mining [J].
Bay Vo ;
Tuong Le ;
Hong, Tzung-Pei ;
Bac Le .
APPLIED INTELLIGENCE, 2014, 41 (03) :759-775
[6]   A lattice-based approach for mining most generalization association rules [J].
Bay Vo ;
Hong, Tzung-Pei ;
Le, Bac .
KNOWLEDGE-BASED SYSTEMS, 2013, 45 :20-30
[7]   Analysis of healthcare quality indicator using data mining and decision support system [J].
Chae, YM ;
Kim, HS ;
Tark, KC ;
Park, HJ ;
Ho, SH .
EXPERT SYSTEMS WITH APPLICATIONS, 2003, 24 (02) :167-172
[8]  
Deng ZH, 2007, PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, P851
[9]   DiffNodesets: An efficient structure for fast mining frequent itemsets [J].
Deng, Zhi-Hong .
APPLIED SOFT COMPUTING, 2016, 41 :214-223
[10]   Fast mining Top-Rank-k frequent patterns by using Node-lists [J].
Deng, Zhi-Hong .
EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (04) :1763-1768