A novel concurrent relational association rule mining approach

被引:29
作者
Czibula, Gabriela [1 ]
Czibula, Istvan Gergely [1 ]
Miholca, Diana-Lucia [1 ]
Crivei, Liana Maria [1 ]
机构
[1] Babes Bolyai Univ, Dept Comp Sci, 1 M Kogalniceanu St, Cluj Napoca 400084, Romania
关键词
Data mining; Relational association rules; Concurrency; SOFTWARE DEFECT PREDICTION;
D O I
10.1016/j.eswa.2019.01.082
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data mining techniques are intensively used to uncover relevant patterns in large volumes of complex data which are continuously extended with newly arrived data instances. Relational association rules (RARs), a data analysis and mining concept, have been introduced as an extension of classical association rules (ARs) for capturing various relationships between the attributes characterizing the data. Due to its NP-completeness, the problem of mining all the interesting RARs within a data set is computationally difficult. As the dimensionality of the data set to be mined increases, the classical algorithm Discovery of Relational Association Rules (DRAR) for RARs mining fails in providing the set of rules in reasonable time. This paper introduces a new approach named CRAR (Concurrent Relational Association Rule mining) which uses concurrency for the RARs discovery process and thus significantly reduces the mining time. The effectiveness of CRAR is empirically validated on nine open source data sets. The reduction in mining time when using CRAR against DRAR emphasizes that it can be successfully applied in various practical data mining scenarios. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:142 / 156
页数:15
相关论文
共 47 条
[41]  
Tan P.-N., 2005, Introduction to Data Mining
[42]  
Termier A., 2011, RRLIG009
[43]   Data mining techniques on satellite images for discovery of risk areas [J].
Traore, Boukaye Boubacar ;
Kamsu-Foguem, Bernard ;
Tangara, Fana .
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 72 :443-456
[44]   PGLCM: efficient parallel mining of closed frequent gradual itemsets [J].
Trong Dinh Thac Do ;
Termier, Alexandre ;
Laurent, Anne ;
Negrevergne, Benjamin ;
Omidvar-Tehrani, Behrooz ;
Amer-Yahia, Sihem .
KNOWLEDGE AND INFORMATION SYSTEMS, 2015, 43 (03) :497-527
[45]  
Uno T., 2004, LCM VER 2 EFFICIENT
[46]  
Wojciechowski M, 2003, LECT NOTES COMPUT SC, V2798, P76
[47]   Applying mutual information for discretization to support the discovery of rare-unusual association rule in cerebrovascular examination dataset [J].
Wulandari, Chandrawati Putri ;
Ou-Yang, Chao ;
Wang, Han-Cheng .
EXPERT SYSTEMS WITH APPLICATIONS, 2019, 118 :52-64