An efficient and resilience linear prefix approach for mining maximal frequent itemset using clustering

被引:0
作者
Sinthuja, M. [1 ,2 ]
Pravinthraja, S. [3 ]
Dhanalakshmi, B. K. [2 ,4 ]
Gururaj, H. L. [5 ]
Ravi, Vinayakumar [6 ]
Lal, G. Jyothish [7 ]
机构
[1] MS Ramaiah Inst Technol, Dept ISE, Bengaluru, India
[2] Visvesvaraya Technol Univ, Belagavi 590018, Karnataka, India
[3] Presidency Univ, Dept CSE, Bengaluru, India
[4] BMS Inst Technol & Management, Dept Comp Sci & Engn, Bengaluru, India
[5] Manipal Acad Higher Educ, Manipal Inst Technol Bengaluru, Dept Informat Technol, Manipal, India
[6] Prince Mohammad Bin Fahd Univ, Ctr Artificial Intelligence, Khobar, Saudi Arabia
[7] Amrita Vishwa Vidyapeetham, Ctr Computat Engn & Networking CEN, Amrita Sch Engn, Coimbatore, India
来源
JOURNAL OF SAFETY SCIENCE AND RESILIENCE | 2025年 / 6卷 / 01期
关键词
Clustering; Data mining; Frequent itemset mining; Linear prefix tree; Maximal frequent itemset mining;
D O I
10.1016/j.jnlssr.2024.08.001
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
The numerous volumes of data generated every day necessitate the deployment of new technologies capable of dealing with massive amounts of data efficiently. This is the case with Association Rules, a tool for unsupervised data mining that extracts information in the form of IF-THEN patterns. Although various approaches for extracting frequent itemset (prior step before mining association rules) in extremely large databases have been presented, the high computational cost and shortage of memory remain key issues to be addressed while processing enormous data. The objective of this research is to discover frequent itemset by using clustering for preprocessing and adopting the linear prefix tree algorithm for mining the maximal frequent itemset. The performance of the proposed CL-LP-MAX-tree was evaluated by comparing it with the existing FP-max algorithm. Experimentation was performed with the three different standard datasets to record evidence to prove that the proposed CL-LP-MAX-tree algorithm outperform the existing FP-max algorithm in terms of runtime and memory consumption.
引用
收藏
页码:93 / 104
页数:12
相关论文
共 19 条
[1]  
Agarwal R. C., 2000, Proceedings. KDD-2000. Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, P108, DOI 10.1145/347090.347114
[2]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[3]  
Dr K.V., 2020, Paideuma J, VXIII, P1
[4]   Efficiently mining maximal frequent itemsets [J].
Gouda, K ;
Zaki, MJ .
2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, :163-170
[5]   A Feature Selection Algorithm Based on Equal Interval Division and Minimal-Redundancy-Maximal-Relevance [J].
Gu, Xiangyuan ;
Guo, Jichang ;
Xiao, Lijun ;
Ming, Tao ;
Li, Chongyi .
NEURAL PROCESSING LETTERS, 2020, 51 (02) :1237-1263
[6]  
Han JW, 2000, SIGMOD RECORD, V29, P1
[7]  
Khedr A.M., 2020, IEEe Access
[8]  
Khedr A.M., 2021, IEEe Access, V9
[9]  
Liu Z., 2021, Fut. Gen. Comput. Syst., V126
[10]   CC-IFIM: an efficient approach for incremental frequent itemset mining based on closed candidates [J].
Magdy, Maged ;
Ghaleb, Fayed F. M. ;
Mohamed, Dawlat A. El A. ;
Zakaria, Wael .
JOURNAL OF SUPERCOMPUTING, 2023, 79 (07) :7877-7899