Visual and textual features based email spam classification using S-Cuckoo search and hybrid kernel support vector machine

被引:0
作者
T. Kumaresan
S. Saravanakumar
R. Balamurugan
机构
[1] Bannari Amman Institute of Technology,
[2] Adithya Institute of Technology,undefined
[3] Bharat Institute of Engineering and Technology,undefined
[4] Ibrahimpatanam,undefined
来源
Cluster Computing | 2019年 / 22卷
关键词
Support vector machine; Cuckoo search; Spam; Correlogram; S-Cuckoo search;
D O I
暂无
中图分类号
学科分类号
摘要
Spam mail classification has been playing a vital role in recent days due to the uncontrollable growth happening in the electronic media. Literature presents several algorithms for email spam classification based on classification methods. In this paper, we propose a spam classification framework using S-Cuckoo and hybrid kernel based support vector machine (HKSVM). At first, the features are extracted from the e-mails based on the text as well as the image. For the textual features, TF-term frequency is used. For the image dependent features, correrlogram and wavelet moment are taken. The hybrid features have then high dimension so the optimum features are identified with the help of hybrid algorithm, called S-Cuckoo search. Then, the classification is done using proposed classifier HKSVM model which is designed based on the hybrid kernel by blending three different kernel functions and then it is used in the SVM classifier. The additional features provided based on image and the modification of SVM classifier provides significant improvement as compared with existing algorithms. The spam classification performance is measured by db1 (combining bare-ling spam and Spam Archive corpus) and db2 (combining lemm-ling spam and Spam Archive corpus). Experimental results show that the proposed spam classification framework has outperformed by having better accuracy of 97.235% when compared with existing approach which is able to achieve only 94.117%.
引用
收藏
页码:33 / 46
页数:13
相关论文
共 83 条
[1]  
Islam MR(2005)An innovative spam filtering model based on support vector machine Computational Intelligence for Modeling, Control and Automation 2 349-353
[2]  
Chowdhury MU(2006)Automatic classification of auditory brainstem responses using SVM-based feature selection algorithm for threshold detection Eng. Appl. Artif. Intell. 19 209-218
[3]  
Zhou W(2004)Cancer recognition with bagged ensembles of support vector machines Neurocomputing 56 461-466
[4]  
Acır N(2005)Automated defect recognition of C-SAM images in IC packaging using Support Vector Machines Int. J. Adv. Manuf. Technol. 25 1191-1196
[5]  
Özdamar Ö(2017)Classification of breast cancer histology images using Convolutional Neural Networks PLoS ONE 12 e0177544-4330
[6]  
Güzeliş C(2017)SVM and SVM Ensembles in breast cancer prediction PLoS ONE 12 e0161501-195
[7]  
Valentini G(2009)Behavior-based spam detection using a hybrid method of rule-based techniques and neural networks Expert Syst. Appl. 36 4321-578
[8]  
Muselli M(2005)A case-based technique for tracking concept drift in spam filtering Knowl. Based Syst. 18 187-985
[9]  
Ruffino F(2006)Tightening the net: a review of current and next generation spam filtering tools Comput. Secur. 25 566-682
[10]  
Zhang YL(2008)On the properties of spam advertised URL addresses J. Netw. Comput. Appl. 31 966-1608