An intelligent feature selection approach with systolic tree structures for efficient association rules in big data environment

被引:3
作者
Abushark, Yoosef B. [1 ]
机构
[1] King Abdulaziz Univ, Fac Comp & Informat Technol, Comp Sci Dept, Jeddah, Saudi Arabia
关键词
Big data; Systolic tree structure; Frequent pattern mining; Frequent pattern growth; Association rule mining; Kernel possibilistic fuzzy local information C; means clustering; Chaotic butterfly optimization algorithm;
D O I
10.1016/j.compeleceng.2022.108080
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Frequent Pattern Mining (FPM) is regularly used in data mining applications for identifying objects of interest that frequent transactional databases. Traditional ARM algorithms work on minimum levels of support and confidence metrics which have to be defined and are subjective. This work attempts to solve this issue by proposing techniques for the automatic determination of these metrics. The Kernel Possibilistic Fuzzy Local Information C-Means (KPFLICM) Clustering and K-means Clustering-based Feature Selection algorithm for Association Rules (KCFSAR) is used where FPMs are based on Systolic Tree Structures (STS). These structures aim to achieve better accuracy by mimicking internal memory structures of the Frequent Pattern Growths (FPG) algorithm. Moreover, Chaotic Butterfly Optimization Method (CBOAs) is used to obtain the optimal support value and CBOA subsequently discovers global optimal solutions. This work's suggested scheme demonstrates its efficacy and superiority in experimental findings when compared to other approaches in terms of feature subset sizes, accuracies, execution times, and memory consumptions.
引用
收藏
页数:15
相关论文
共 25 条
[1]  
Alhroob A, 2020, INT J ADV COMPUT SC, V11, P703
[2]  
Ammar SM, 2018, J ADV MATH COMPUT SC, P1
[3]   Butterfly optimization algorithm: a novel approach for global optimization [J].
Arora, Sankalap ;
Singh, Satvir .
SOFT COMPUTING, 2019, 23 (03) :715-734
[4]   Generalized Possibilistic Fuzzy C-Means with novel cluster validity indices for clustering noisy data [J].
Askari, S. ;
Montazerin, N. ;
Zarandi, M. H. Fazel .
APPLIED SOFT COMPUTING, 2017, 53 :262-283
[5]   On the performance improvement of Butterfly Optimization approaches for global optimization and Feature Selection [J].
Assiri, Adel Saad .
PLOS ONE, 2021, 16 (01)
[6]   Positive and negative association rule mining in Hadoop's MapReduce environment [J].
Bagui, Sikha ;
Dhar, Probal Chandra .
JOURNAL OF BIG DATA, 2019, 6 (01)
[7]  
Ban T., 2015, P IEEE INT JOINT C N, P1, DOI DOI 10.1109/IJCNN.2015.7280818
[8]   Kavosh: an effective Map-Reduce-based association rule mining method [J].
Barkhordari, Mohammadhossein ;
Niamanesh, Mahdi .
JOURNAL OF BIG DATA, 2018, 5 (01)
[9]  
Bouraoui M, 2017, I C SCI TECH AUTO CO, P185, DOI 10.1109/STA.2017.8314975
[10]   On the design of hardware-software architectures for frequent itemsets mining on data streams [J].
Bustio-Martinez, Lazaro ;
Cumplido, Rene ;
Hernandez-Leon, Raudel ;
Bande-Serrano, Jose M. ;
Feregrino-Uribe, Claudia .
JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2018, 50 (03) :415-440