Mining and prioritization of association rules for big data: multi-criteria decision analysis approach

被引:14
作者
Ait-Mlouk A. [1 ]
Agouti T. [1 ]
Gharnati F. [1 ]
机构
[1] Laboratory of Intelligent Energy Management and Information Systems, Faculty of Sciences Semlalia, Cadi Ayyad University, Marrakech
关键词
Apache Spark; Association rules; Big data; Data mining; Multi-criteria decision analysis; PFP-growth; Road accident;
D O I
10.1186/s40537-017-0105-4
中图分类号
学科分类号
摘要
Data mining techniques and extracting patterns from large datasets play a vital role in knowledge discovery. Most of the decision makers encounter a large number of decision rules resulted from association rules mining. Moreover, the volume of datasets brings a new challenge to extract patterns such as the cost of computing and inefficiency to achieve the relevant rules. To overcome these challenges, this paper aims to build a learning model based on FP-growth and Apache Spark framework to process and to extract relevant association rules. We also integrate the multi-criteria decision analysis to prioritize the extracted rules by taking into account the decision makers subjective judgment. We believe that this approach would be a useful model to follow, particularly for decision makers who are suffering from conflicts between extracted rules, and difficulties of building only the most interesting rules. Experimental results on road accidents analysis show that the proposed approach can be efficiently achieved more association rules with a higher accuracy rate and improve the response time of the proposed algorithm. The results make clear that the proposed approach performs well and can provide useful information that could help the decision makers to improve road safety. © 2017, The Author(s).
引用
收藏
相关论文
共 34 条
[11]  
Prajapati D.J., Garg S., Chauhan N.C., Interesting association rule mining with consistent and inconsistent rule detection from big sales data in distributed environment, Future Comput Inform J., 2, 1, pp. 19-30, (2017)
[12]  
Padillo F., Luna J.M., Ventura S., An evolutionary algorithm for mining rare association rules: a big data approach, IEEE congress on evolutionary computation (CEC). San Sebastian: IEEE
[13]  
, pp. 2007-2014, (2017)
[14]  
Han J., Pei J., Yin Y., Mao R., Mining frequent patterns without candidate generation: a frequent-pattern tree approach, Data Min Knowl Disc., 8, 1, pp. 53-87, (2004)
[15]  
Agouti T, Gharnati F. Comparative survey of association rule mining algorithms based on multiple-criteria decision analysis approach. In: Control, engineering & information technology (CEIT), 2015 3rd international conference on, New York: IEEE
[16]  
2015. p., 1-6, pp. 25-27
[17]  
Brin S., Motwani R., Silverstein C., Beyond market baskets: generalizing association rules to correlations, ACM SIGMOD/PODS 97 joint conference, pp. 265-276, (1997)
[18]  
Good I.J., The estimation of probabilities: an essay on modern Bayesian methods, (1965)
[19]  
Brin S., Motwani R., Ullman J.D., counting T.S.D., Peckham J, editor. Proceedings of the 1997 ACM SIGMOD international conference on management of data, New York: ACM
[20]  
1997. p., 255-264, pp. 13-15