Whale optimization algorithm-based email spam feature selection method using rotation forest algorithm for classification

被引:0
作者
Maryam Shuaib
Shafi’i Muhammad Abdulhamid
Olawale Surajudeen Adebayo
Oluwafemi Osho
Ismaila Idris
John K. Alhassan
Nadim Rana
机构
[1] Federal University of Technology,Department of Cyber Security Science
[2] Jazan University,College of Computer Science and Information Systems
来源
SN Applied Sciences | 2019年 / 1卷
关键词
Whale optimization algorithm; Metaheuristic algorithm; Email spam; Classification algorithms; Rotation forest; Feature selection;
D O I
暂无
中图分类号
学科分类号
摘要
Email has continued to be an integral part of our lives and as a means for successful communication on the internet. The problem of spam mails occupying a huge amount of space and bandwidth, and the weaknesses of spam filtering techniques which includes misclassification of genuine emails as spam (false positives) are a growing challenge to the internet world. This research work proposed the use of a metaheuristic optimization algorithm, the whale optimization algorithm (WOA), for the selection of salient features in the email corpus and rotation forest algorithm for classifying emails as spam and non-spam. The entire datasets were used, and the evaluation of the rotation forest algorithm was done before and after feature selection with WOA. The results obtained showed that the rotation forest algorithm after feature selection with WOA was able to classify the emails into spam and non-spam with a performance accuracy of 99.9% and a low FP rate of 0.0019. This shows that the proposed method had produced a remarkable improvement as compared with some previous methods.
引用
收藏
相关论文
共 75 条
[1]  
Alguliev Rasim M.(2011)Classification of Textual E-Mail Spam Using Data Mining Techniques Applied Computational Intelligence and Soft Computing 2011 1-8
[2]  
Aliguliyev Ramiz M.(2016)Optimizing connection weights in neural networks using the whale optimization algorithm Soft Comput 46 81-94
[3]  
Nazirova Saadat A.(2013)A comparative study on feature selection and adaptive strategies for email foldering using the ABC-DynF framework Knowl-Based Syst 25 51-65
[4]  
Aljarah I(2014)Gene tree correction for reconciliation and species tree inference: complexity and algorithms J Discrete Algorithms 15 319-326
[5]  
Faris H(2016)A whale optimization algorithm with inertia weight WSEAS Trans Comput 63 513-623
[6]  
Mirjalili S(1996)Metaheuristics: a bibliography Ann Oper Res 22 11-27
[7]  
Carmona-cejudo JM(2014)Improved email spam detection model with negative selection algorithm and particle swarm optimization Appl Soft Comput J 28 97-110
[8]  
Castillo G(2014)Hybrid email spam detection model with negative selection algorithm and differential evolution Eng Appl Artif Intell 4 342-344
[9]  
Baena-garcía M(2015)Spam detection using data mining tool in Matlab Int J Adv Res Comput Commun Eng 29 279-293
[10]  
Morales-bueno R(2016)Effective spam filtering using random forest Int J Innov Res Comput Commun Eng 4 4237-4242