A novel concurrent relational association rule mining approach

被引:29
作者
Czibula, Gabriela [1 ]
Czibula, Istvan Gergely [1 ]
Miholca, Diana-Lucia [1 ]
Crivei, Liana Maria [1 ]
机构
[1] Babes Bolyai Univ, Dept Comp Sci, 1 M Kogalniceanu St, Cluj Napoca 400084, Romania
关键词
Data mining; Relational association rules; Concurrency; SOFTWARE DEFECT PREDICTION;
D O I
10.1016/j.eswa.2019.01.082
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data mining techniques are intensively used to uncover relevant patterns in large volumes of complex data which are continuously extended with newly arrived data instances. Relational association rules (RARs), a data analysis and mining concept, have been introduced as an extension of classical association rules (ARs) for capturing various relationships between the attributes characterizing the data. Due to its NP-completeness, the problem of mining all the interesting RARs within a data set is computationally difficult. As the dimensionality of the data set to be mined increases, the classical algorithm Discovery of Relational Association Rules (DRAR) for RARs mining fails in providing the set of rules in reasonable time. This paper introduces a new approach named CRAR (Concurrent Relational Association Rule mining) which uses concurrency for the RARs discovery process and thus significantly reduces the mining time. The effectiveness of CRAR is empirically validated on nine open source data sets. The reduction in mining time when using CRAR against DRAR emphasizes that it can be successfully applied in various practical data mining scenarios. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:142 / 156
页数:15
相关论文
共 47 条
[1]  
Agrawal R., P 20 INT C VERY LARG, DOI DOI 10.1055/S-2007-996789
[2]  
[Anonymous], 2005, Data Mining: Concepts and Techniques
[3]  
[Anonymous], 2012, The promise repository of empirical software engineering data, Book The promise repository of empirical software engineering data, Series The promise repository of empirical software engineering data
[4]  
Anping Song, 2015, ICIC Express Letters, V9, P2387
[5]  
Bhujade M, 2009, PARALLEL COMPUTING
[6]   Identifying risk factors for adverse diseases using dynamic rare association rule mining [J].
Borah, Anindita ;
Nath, Bhabesh .
EXPERT SYSTEMS WITH APPLICATIONS, 2018, 113 :233-263
[7]   Mining frequent itemsets in a stream [J].
Calders, Toon ;
Dexters, Nele ;
Gillis, Joris J. M. ;
Goethals, Bart .
INFORMATION SYSTEMS, 2014, 39 :233-255
[8]  
Campan A., 2006, STUD U BABES BOLYAI, VLI, P31
[9]  
Campan A., 2006, DMIN, V6, P107
[10]  
Chia-Chu Chiang, 2010, 2010 International Conference on System Science and Engineering (ICSSE 2010), P593, DOI 10.1109/ICSSE.2010.5551704