Mining association rules for classification using frequent generator itemsets in arules package

被引:2
作者
Ledmi, Makhlouf [1 ]
Souidi, Mohammed El Habib [1 ]
Hahsler, Michael [2 ]
Ledmi, Abdeldjalil [1 ]
Kara-Mohamed, Chafia [3 ]
机构
[1] Abbas Laghrour Univ Khenchela, Dept Comp Sci, ICOSI Lab, Khenchela 40000, Algeria
[2] SMU, Bobby B Lyle Sch Engn, Dept Comp Sci, POB 750122, Dallas, TX 75275 USA
[3] Ferhat Abbas Univ Set 1, Dept Comp Sci, LRSD Lab, Setif 19000, Algeria
关键词
frequent generator itemsets; FGIs; classification; association rules; data mining; R language; EFFICIENT; PATTERNS; REPRESENTATION; TREE;
D O I
10.1504/IJDMMM.2023.131399
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mining frequent itemsets is an attractive research activity in data mining whose main aim is to provide useful relationships among data. Consequently, several open-source development platforms are continuously developed to facilitate the users' exploitation of new data mining tasks. Among these platforms, the R language is one of the most popular tools. In this paper, we propose an extension of arules package by adding the option of mining frequent generator itemsets. We discuss in detail how generators can be used for a classification task through an application example in relation with COVID-19.
引用
收藏
页码:203 / 221
页数:20
相关论文
共 33 条
[1]  
Abbasi S., 2021, Journal of Algorithms and Computation, V53, P197, DOI DOI 10.22059/JAC.2021.81890
[2]   A novel feature selection method for data mining tasks using hybrid Sine Cosine Algorithm and Genetic Algorithm [J].
Abualigah, Laith ;
Dulaimi, Akram Jamal .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2021, 24 (03) :2161-2176
[3]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[4]  
Al-Maolegi M, 2014, Arxiv, DOI arXiv:1403.3948
[5]  
Alasadi S.A., 2017, Journal of Engineering and Applied Sciences, V12, P4102, DOI DOI 10.3923/JEASCI.2017.4102.4107
[6]  
Bastide Y., 2000, COMPUTATIONAL
[7]  
Borgelt C., 2003, FIMI 03, V90
[8]   Free-sets: A condensed representation of Boolean data for the approximation of frequency queries [J].
Boulicaut, JF ;
Bykowski, A ;
Rigotti, C .
DATA MINING AND KNOWLEDGE DISCOVERY, 2003, 7 (01) :5-22
[9]  
Chang R., 2011, P 2011 INT C ELECT O, V1, pV1
[10]  
Davashi R, 2019, INT J DATA MIN MODEL, V11, P144