A New Approach to Improve Association Rules for Big Data in Cloud Environment

被引:0
作者
Dahmani, Djilali [1 ]
Rahal, Sidi Ahmed [1 ]
Belalem, Ghalem [1 ]
机构
[1] Univ Sci & Technol, Dept Comp Sci, Bab Ezzouar, Algeria
关键词
Big data; association rules; rule patterns; ontology; cloud computing; NoSQL;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The technique of association rules is very useful in Data Mining, but it generates a huge number of rules. So, a manual post-processing is required to target only the interesting rules. Several researchers suggest integrating users' knowledge by using ontology and rule patterns, and then select automatically the interesting rules after generating all possible rules. However, nowadays the business data are extremely increasing, and many companies have already opted for Big Data systems deployed in cloud environments, then the process of generating association rules becomes very hard. To deal with this issue, we propose an approach using ontology with rule patterns to integrate users' knowledge early in the preprocessing step before searching or generating any rule. So, only the interesting rules which respect the rule patterns will be generated. This approach allows reducing execution time and minimizing the cost of the post-processing especially for Big Data. To confirm the performance results, experiments are carried out on Not Only Strutured Query Language (NoSQL) databases which are distributed in a cloud environment.
引用
收藏
页码:1013 / 1020
页数:8
相关论文
共 22 条
[1]  
[Anonymous], 1999, Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD'99, DOI [DOI 10.1145/312129.312219, 10.1145/3121312219, DOI 10.1145/3121312219, 10.1145/312129.312219]
[2]  
[Anonymous], P ACM SIGMOD DMKD 00
[3]  
Bing Liu, 1999, Methodologies for Knowledge Discovery and Data Mining. Third Pacific-Asia Conference, PAKDD-99. Proceedings, P380
[4]   Improving the Performance of Data Mining by Using Big Data in Cloud Environment [J].
Dahmani, Djilali ;
Rahal, Sid Ahmed ;
Belalem, Ghalem .
JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2016, 15 (04)
[5]  
Guillet F., 2007, Quality Measures in Data Mining
[6]  
Han J, 2012, MOR KAUF D, P1
[7]  
Han Jiawi, 2000, DATA MIN KNOWL DISC, V8, P53, DOI DOI 10.1023/B:DAMI.0000005258.31418.83
[8]  
Hastie T., 2016, The Elements of Statistical Learning
[9]  
Hilderman RJ., 2001, ADV KNOWLEDGE DISCOV, P247
[10]  
JeyaKumar K, 2018, INT ARAB J INF TECHN, V15, P157