Deep convolutional forest: a dynamic deep ensemble approach for spam detection in text

被引:0
作者
Mai A. Shaaban
Yasser F. Hassan
Shawkat K. Guirguis
机构
[1] Alexandria University,Department of Mathematics and Computer Science, Faculty of Science
[2] Alexandria University,Faculty of Computers and Data Science
[3] Alexandria University,Institute of Graduate Studies and Research
来源
Complex & Intelligent Systems | 2022年 / 8卷
关键词
Ensemble methods; Deep learning; Machine learning; Spam classification; Text messages;
D O I
暂无
中图分类号
学科分类号
摘要
The increase in people’s use of mobile messaging services has led to the spread of social engineering attacks like phishing, considering that spam text is one of the main factors in the dissemination of phishing attacks to steal sensitive data such as credit cards and passwords. In addition, rumors and incorrect medical information regarding the COVID-19 pandemic are widely shared on social media leading to people’s fear and confusion. Thus, filtering spam content is vital to reduce risks and threats. Previous studies relied on machine learning and deep learning approaches for spam classification, but these approaches have two limitations. Machine learning models require manual feature engineering, whereas deep neural networks require a high computational cost. This paper introduces a dynamic deep ensemble model for spam detection that adjusts its complexity and extracts features automatically. The proposed model utilizes convolutional and pooling layers for feature extraction along with base classifiers such as random forests and extremely randomized trees for classifying texts into spam or legitimate ones. Moreover, the model employs ensemble learning procedures like boosting and bagging. As a result, the model achieved high precision, recall, f1-score and accuracy of 98.38%.
引用
收藏
页码:4897 / 4909
页数:12
相关论文
共 70 条
[1]  
Goel D(2018)Mobile phishing attacks and defence mechanisms: state of art and open research challenges Comput Secur 73 519-544
[2]  
Jain A(2020)A novel approach to detect spam and smishing SMS using machine learning techniques Int J EServ Mob Appl 12 21-38
[3]  
Jain A(2021)Online social networks security and privacy: comprehensive review and analysis Complex Intell Syst 29 473-529
[4]  
Yadav S(2021)A review on social spam detection: challenges, open issues, and future directions Expert Syst Appl 102 524-533
[5]  
Choudhary N(2021)Advances in spam detection for email spam, web spam, social network spam, and review spam: ML-based and nature-inspired-based techniques J Comput Secur 45 5-32
[6]  
Jain AK(2020)Deep learning to filter SMS spam Future Gener Comput Syst 155 167-176
[7]  
Sahoo SR(2001)Random forests Mach Learn 254 33-41
[8]  
Kaubiyal J(2015)Spam filtering for short messages in adversarial environment Neurocomputing 13 315-331
[9]  
Rao S(2017)Document representation and feature combination for deceptive spam review detection Neurocomputing 24 9625-9638
[10]  
Verma A(2018)Ham and spam E-mails classification using machine learning techniques J Appl Secur Res 48 3538-3556