Deep learning-based spam image filtering

被引:5
|
作者
Salama, Wessam M. [1 ]
Aly, Moustafa H. [2 ]
Abouelseoud, Yasmine [3 ]
机构
[1] Pharos Univ, Fac Engn, Dept Basic Sci, Alexandria, Egypt
[2] Arab Acad Sci Technol & Maritime Transport, Coll Engn Technol, Dept Elect & Commun Engn, Alexandria, Egypt
[3] Alexandria Univ, Fac Engn, Dept Engn Math & Phys, Alexandria, Egypt
关键词
Deep learning; Transfer learning; Data augmentation; Image spam;
D O I
10.1016/j.aej.2023.01.048
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Spam is some unwanted material that may be put in the form of images. While many machine learning approaches are effective at detecting textual spam, this is not true for image spam. In this paper, a new framework for identifying image spams is proposed. The images are divided into two categories: undesirable material contained in the form of images which is referred to as a spam image, whereas anything else is referred to as a ham image. Our proposed methodology is based on applying different pre-trained deep learning models, including InceptionV3, Densely Connected Convolutional Networks 121(DenseNet121), Residual Networks (ResNet50), Visual Geometry Group (VGG16) and MobileNetV2, to filter out the unwanted spam images. Different standard test datasets such as Dredze Dataset, Image Spam Hunter (ISH) Dataset and Improved Dataset are utilized in this paper for performance testing. Furthermore, transfer learning and data augmentation are employed to address the issue of a shortage of labeled data. In our implementation, the fully connected (FC) layer in the aforementioned pre-trained models is replaced with a Support Vector Machine (SVM) classifier, resulting in an improved accuracy. The obtained results reveal that ResNet50 model yields the best performance achieving 99.87% accuracy, 99.88% area under the curve (AUC), 99.98% sensitivity, 99.79% precision, 98.99% F1 score and a computational testing time of in the order of one to two seconds for the ISH dataset. (c) 2023 The Authors. Published by Elsevier BV on behalf of Faculty of Engineering, Alexandria University This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/ licenses/by-nc-nd/4.0/).
引用
收藏
页码:461 / 468
页数:8
相关论文
共 50 条
  • [1] A survey of learning-based techniques of email spam filtering
    Enrico Blanzieri
    Anton Bryl
    Artificial Intelligence Review, 2008, 29 : 63 - 92
  • [2] A survey of learning-based techniques of email spam filtering
    Blanzieri, Enrico
    Bryl, Anton
    ARTIFICIAL INTELLIGENCE REVIEW, 2008, 29 (01) : 63 - 92
  • [3] PROTECTOR: An optimized deep learning-based framework for image spam detection and prevention
    Makkar, Aaisha
    Kumar, Neeraj
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 125 : 41 - 58
  • [4] High Efficiency Spam Filtering: A Manifold Learning-Based Approach
    Wang, Chao
    Li, Qun
    Ren, Tian-yu
    Wang, Xiao-hu
    Guo, Guang-xin
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [5] Dynamic PET Image Denoising With Deep Learning-Based Joint Filtering
    He, Yuru
    Cao, Shuangliang
    Zhang, Hongyan
    Sun, Hao
    Wang, Fanghu
    Zhu, Huobiao
    Lv, Wenbing
    Lu, Lijun
    IEEE ACCESS, 2021, 9 : 41998 - 42012
  • [6] Efficient spam and phishing emails filtering based on deep learning
    Magdy, Safaa
    Abouelseoud, Yasmine
    Mikhail, Mervat
    COMPUTER NETWORKS, 2022, 206
  • [7] Efficient spam and phishing emails filtering based on deep learning
    Magdy S.
    Abouelseoud Y.
    Mikhail M.
    Computer Networks, 2022, 206
  • [8] Spam Image Detection Model based on Deep Learning for Improving Spam Filter
    Nam, Seong-Guk
    Lee, Dong-Gun
    Seo, Yeong-Seok
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2023, 19 (03): : 289 - 301
  • [9] Clustering-based Spam Image Filtering Considering Fuzziness of the Spam Image
    Prince, Master
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (12) : 268 - 270
  • [10] Deobfuscating Leetspeak With Deep Learning to Improve Spam Filtering
    de Mendizabal, Inaki Velez
    Vidriales, Xabier
    Basto-Fernandes, Vitor
    Ezpeleta, Enaitz
    Mendez, Jose R.
    Zurutuza, Urko
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2023, 8 (04): : 46 - 55