Bayesian spam filtering mechanism based on decision tree of attribute set dependence in the Mapreduce framework

被引:0
作者
机构
[1] School of Information Science and Engineering, Changzhou University, Jiangsu, Changzhou
[2] Department of Computer, Henan Institute of Engineering, Henan, Zhengzhou
来源
Sun, Yuqiang | 1600年 / Bentham Science Publishers B.V., P.O. Box 294, Bussum, 1400 AG, Netherlands卷 / 08期
关键词
Bayesian spam filtering; Decision tree of attribute sets dependence; MapReduce;
D O I
10.2174/1874110X01408010435
中图分类号
学科分类号
摘要
Bayesian spam filtering is a classification method based on the theory of probability and statistics, and the Bayesian spam filtering based on Mapreduce can solve the defect of the traditional Bayesian spam filtering that consumes large amounts of system resources and network resources when the mail set is pre-training. It needs to classify mails manually in the pre-training phase of mail set, which consumes a lot of human and financial resources and affects the efficiency of the system. Bayesian spam filtering mechanism based on decision tree of the attribute sets dependence in the MapReduce framework which is presented in this paper. And the decision tree of attribute sets dependence is used in the training stage of the mail set, which improves execution efficiency of the system by lowering the time complexity. © Guo et al.; Licensee Bentham Open.
引用
收藏
页码:435 / 441
页数:6
相关论文
共 8 条
[1]  
Sahami M., Dumais S., Heckerman D., A Bayesian Approach to Filtering Junk E-Mail, pp. 55-62, (1998)
[2]  
Dean J., Ghemawat S., MapReduce: simplified data processing on large clusters, Comm. ACM, 51, 1, pp. 107-113, (2008)
[3]  
Isard M., Budiu M., Yu Y., Birrell A., Fetterly D., Dryad: Distributed data-parallel programs from sequential building blocks, In: Proc. 2Nd Eur. Conf. Com. Syst., (Eurosys), pp. 59-72, (2007)
[4]  
Pawlak Z., Skowron A., Rudiments of rough sets, Inform. Sci, 177, 1, pp. 3-27, (2007)
[5]  
Pawlak Z., Rough set theory and its applications to data analysis, Cybern. Syst, 29, pp. 661-688, (1988)
[6]  
Rish I., An empirical study of the Naïve Bayesian classifier, In: Proc. IJCAI Workshop Empir, pp. 41-46, (2001)
[7]  
China Education and Research Network
[8]  
Deng W.B., Hong Z., Two-stage spam filtering method based on rough sets, Comput. Appl, 30, 8, pp. 2006-2009, (2010)