Incremental frequent itemsets mining based on frequent pattern tree and multi-scale

被引:24
作者
Xun, Yaling [1 ]
Cui, Xiaohui [1 ]
Zhang, Jifu [1 ]
Yin, Qingxia [1 ]
机构
[1] Taiyuan Univ Sci & Technol, Sch Comp Sci & Technol, Taiyuan 030024, Peoples R China
关键词
Frequent itemsets mining; Multi-scale; Incremental mining; Frequent pattern tree; Association rules; ASSOCIATION RULES; ALGORITHM; MAINTENANCE;
D O I
10.1016/j.eswa.2020.113805
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-scale can reveal the structure and hierarchical characteristics of the data objects to reflect their essence from different perspectives and levels. An incremental frequent itemsets mining algorithm based on frequent pattern tree is proposed by incorporating multi-scale theory(simplified to FP-tree and Multi-Scale based Incremental Mining, FPMSIM). FPMSIM uses the classic FP-Growth to construct a pattern tree and generate frequent itemsets for more fine-grained dataset which is called benchmark scale dataset. The newly added dataset is also independently mined as a benchmark scale dataset. The ultimate frequent itemsets for the target scales are derived by means of the scale-up process. In which, some unknown itemsets counts need to be estimated by comparing the similarity among benchmark scale datasets. In this way, severe dataset rescanning and tree structure adjustment overhead are avoided during the maintenance process. The experimental results show that although the support estimation error will lead to incomplete frequent itemsets mining, it can be offset by the performance gains in the mining efficiency and I/O cost, especially in the field of big data.
引用
收藏
页数:13
相关论文
共 31 条
[1]  
Agrawal R., P 20 INT C VERY LARG
[2]   A Novel Clustering Algorithm Based on DPC and PSO [J].
Cai, Jianghui ;
Wei, Huiling ;
Yang, Haifeng ;
Zhao, Xujun .
IEEE ACCESS, 2020, 8 :88200-88214
[3]   Maintenance of discovered association rules in large databases: Art incremental updating technique [J].
Cheung, DW ;
Han, JW ;
Ng, VT ;
Wong, CY .
PROCEEDINGS OF THE TWELFTH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, 1996, :106-114
[4]  
Chia-Han Yang, 2009, 2009 International Conference on Computational Science and Engineering (CSE), P324, DOI 10.1109/CSE.2009.360
[5]   Highly Efficient Pattern Mining Based on Transaction Decomposition [J].
Djenouri, Youcef ;
Lin, Jerry Chun-Wei ;
Norvag, Kjetil ;
Ramampiaro, Heri .
2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, :1646-1649
[6]   A new framework for metaheuristic-based frequent itemset mining [J].
Djenouri, Youcef ;
Djenouri, Djamel ;
Belhadi, Asma ;
Fournier-Viger, Philippe ;
Lin, Jerry Chun-Wei .
APPLIED INTELLIGENCE, 2018, 48 (12) :4775-4791
[7]   MOON-BASED EARTH OBSERVATION FOR LARGE SCALE GEOSCIENCE PHENOMENA [J].
Guo, Huadong ;
Liu, Guang ;
Ding, Yixing ;
Zou, Yongliao ;
Huang, Shaopeng ;
Jiang, Liming ;
Jia Gensuo ;
Lv, Mingyang ;
Ren, Yuanzhen ;
Ruan, Zhixing ;
Ye, Hanlin .
2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, :3705-3707
[8]   Mining frequent patterns without candidate generation: A frequent-pattern tree approach [J].
Han, JW ;
Pei, J ;
Yin, YW ;
Mao, RY .
DATA MINING AND KNOWLEDGE DISCOVERY, 2004, 8 (01) :53-87
[9]   Incrementally fast updated frequent pattern trees [J].
Hong, Tzung-Pei ;
Lin, Chun-Wei ;
Wu, Yu-Lung .
EXPERT SYSTEMS WITH APPLICATIONS, 2008, 34 (04) :2424-2435
[10]   A novel approach for mining frequent patterns from incremental data [J].
Jindal, Rajni ;
Borah, Malaya Dutta .
INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2016, 8 (03) :244-264