Hybrid momentum accelerated bat algorithm with GWO based optimization approach for spam classification

被引:11
作者
Dhal, Pradip [1 ,2 ]
Azad, Chandrashekhar [1 ]
机构
[1] Natl Inst Technol, ITER, Dept Comp Sci & Engn, Jamshedpur 831014, Jharkhand, India
[2] Siksha O Anusandhan Deemed Be Univ, Dept Comp Sci & Engn, Bhubaneswar 751030, Odisha, India
基金
英国科研创新办公室;
关键词
Spam detection; Feature selection; Bat algorithm; Grey wolf optimization; NEGATIVE SELECTION ALGORITHM; PARTICLE SWARM OPTIMIZATION; GREY WOLF OPTIMIZER; DETECTION MODEL; LEVY FLIGHT; NEURAL-NETWORKS; EMAIL; SYSTEM;
D O I
10.1007/s11042-023-16448-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Spam emails have become more prevalent, necessitating the development of more effective and reliable anti-spam filters. Internet users face security threats, and youngsters are exposed to inappropriate content while receiving spam emails. The gigantic data flow between billions of people and the tremendous number of features (attributes) makes the task more tiresome and complex. Feature Selection (FS) technique is essential for overwhelming accuracy, time and spatial complexity when we have high dimensional data (i.e., the number of features is very large). Spam emails have been successfully filtered and detected using Machine Learning (ML) methods by various researchers nowadays. This work proposes a hybrid binary Metaheuristic Algorithm (MA) based Feature Selection (FS) approach for classifying email spam. The proposed FS approach is based upon two MA, i.e., Bat Algorithm (BA) with Grey Wolf Optimization(GWO). A novel concept of bat momentum has been introduced here, replacing the previous bat velocity. Two quantity, i.e., velocity and momentum, has an entirely different effect on the particle (i.e. bats). But they always follow the exact directions for both of them. To provide the best possible set of features for the FS process, the proposed approach uses an amalgamation technique to reach both the global and local optimum solution. To get the global optimum solution, a new momentum-based equation has been added to the BA, substituting the velocity equation from the prior BA. The GWO property has been added to the momentum-based equation mentioned above to improve the FS process search capabilities. Here a novel concept convergence timer has been introduced, which can eliminate the convergence issue in the iterative algorithm if it arises. A novel GWO based levy flight update has been introduced here to produce the local optimum solution. We have evaluated our proposed method on two benchmark spam corpora (Spambase, SpamAssassin) having different significant properties. The proposed FS approach has been tested on various classification and clustering algorithms to check the robustness and how the model will behave on unknown data. After comparing multiple state-of-the-art and existing approaches, the proposed method is superior in boosting classification accuracy while minimizing the features in the feature set for misclassifying legitimate emails as spam.
引用
收藏
页码:26929 / 26969
页数:41
相关论文
共 83 条
[1]  
Abdulhamid Shafi'i Muhammad, 2018, International Journal of Computer Network and Information Security, V10, P60, DOI [10.5815/ijcnis.2018.01.07, 10.5815/ijcnis.2018.01.07]
[2]   An Enhanced Version of Black Hole Algorithm via Levy Flight for Optimization and Data Clustering Problems [J].
Abdulwahab, Haneen A. ;
Noraziah, A. ;
Alsewari, Abdulrahman A. ;
Salih, Sinan Q. .
IEEE ACCESS, 2019, 7 :142085-142096
[3]   Disposition-Based Concept Drift Detection and Adaptation in Data Stream [J].
Agrahari, Supriya ;
Singh, Anil Kumar .
ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (08) :10605-10621
[4]  
Ahmed B., 2020, J. Soft Comput. Data Min., V1, P44
[5]   Hybrid Water Cycle Optimization Algorithm With Simulated Annealing for Spam E-mail Detection [J].
Al-Rawashdeh, Ghada ;
Mamat, Rabiei ;
Abd Rahim, Noor Hafhizah Binti .
IEEE ACCESS, 2019, 7 :143721-143734
[6]  
Amjad S, 2019, NOVEL HYBRID APPROAC
[7]   A3N: Attention-based adversarial autoencoder network for detecting anomalies in video sequence [J].
Aslam, Nazia ;
Rai, Prateek Kumar ;
Kolekar, Maheshkumar H. .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 87
[8]   Unsupervised anomalous event detection in videos using spatio-temporal inter-fused autoencoder [J].
Aslam, Nazia ;
Kolekar, Maheshkumar H. .
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (29) :42457-42482
[9]  
Awad W. A., 2011, International Journal of Computer Science & Information Technology, V3, P173, DOI 10.5121/ijcsit.2011.3112
[10]   Feature selection using an improved Chi-square for Arabic text classification [J].
Bahassine, Said ;
Madani, Abdellah ;
Al-Sarem, Mohammed ;
Kissi, Mohamed .
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2020, 32 (02) :225-231