A Novel Incremental Data Mining Algorithm based on FP-Growth for Big Data

被引:14
作者
Chang, Hong-Yi [1 ]
Lin, Jia-Chi [1 ]
Cheng, Mei-Li [1 ]
Huang, Shih-Chang [2 ]
机构
[1] Natl Chiayi Univ, Dept Management Informat Syst, Chiayi, Taiwan
[2] Natl Formosa Univ, Dept Comp Sci & Informat, Huwei Township, Yunlin, Taiwan
来源
PROCEEDINGS 2016 INTERNATIONAL CONFERENCE ON NETWORKING AND NETWORK APPLICATIONS NANA 2016 | 2016年
关键词
Incremental Data Mining Algorithm; Big Data FP-Growth; Cloud Computing;
D O I
10.1109/NaNA.2016.77
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Association rule mining is an important data analysis and data mining technique. With the advent of big data, new transaction data increase steadily, and thus the analysis results of association rule mining called frequent itemsets, should be updated over time. Mining the frequent itemsets has become an important issue in dynamic massive databases. To address the issue of incremental updating of frequent itemsets, an efficient algorithm is required for incremental data mining. In this paper, we propose a novel incremental data mining algorithm based on FP-Growth, using the concept of heap tree to address the issue of incremental updating of frequent itemsets. The proposed method retains the advantages of FP-Growth, and significantly reduces the complexity associated with re-mining frequent itemsets during incremental updating. The experimental results show that the proposed algorithm not only reduces the execution time substantially but also outperforms other algorithms.
引用
收藏
页码:375 / 378
页数:4
相关论文
共 12 条
  • [1] Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
  • [2] [Anonymous], 1994, P INT C VERY LARGE D
  • [3] [Anonymous], MINING GEN ASS RULES
  • [4] Generalized association rule mining with constraints
    Baralis, Elena
    Cagliero, Luca
    Cerquitelli, Tania
    Garza, Paolo
    [J]. INFORMATION SCIENCES, 2012, 194 : 68 - 84
  • [5] Maintenance of discovered association rules in large databases: Art incremental updating technique
    Cheung, DW
    Han, JW
    Ng, VT
    Wong, CY
    [J]. PROCEEDINGS OF THE TWELFTH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, 1996, : 106 - 114
  • [6] Fayyad U, 1996, AI MAG, V17, P37
  • [7] Research on Improved Apriori Algorithm Based on Coding and MapReduce
    Guo, Jian
    Ren, Yong-gong
    [J]. 2013 10TH WEB INFORMATION SYSTEM AND APPLICATION CONFERENCE (WISA 2013), 2013, : 294 - 299
  • [8] Mining frequent patterns without candidate generation: A frequent-pattern tree approach
    Han, JW
    Pei, J
    Yin, YW
    Mao, RY
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2004, 8 (01) : 53 - 87
  • [9] Listening to music enhances spatial-temporal reasoning: Evidence for the Mozart Effect
    Hetland, L
    [J]. JOURNAL OF AESTHETIC EDUCATION, 2000, 34 (3-4) : 105 - 148
  • [10] Park JS, 1997, IEEE T KNOWL DATA EN, V9, P813, DOI 10.1109/69.634757