Enhancing phishing email detection with stylometric features and classifier stacking

被引:0
|
作者
Chanis, Ilias [1 ]
Arampatzis, Avi [1 ]
机构
[1] Democritus Univ Thrace, Dept Elect & Comp Engn, Xanthi, Greece
关键词
Phishing email; Phishing detection; Machine learning; Natural language processing; Stylometry; Classifier tacking; SUPERVISED CLASSIFICATION;
D O I
10.1007/s10207-024-00928-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Phishing is the most common and potentially dangerous cyber attack that organizations are forced to deal with on a constant basis, rendering its automated detection as early as possible a necessity to ensure the security of computer systems. Focusing on the email level, this work improves content-based phishing email detection by integrating stylometric features with the commonly-used vectorization techniques, as well as by utilizing classifier stacking. Leveraging a diverse set of stylometric features, we systematically explore different methods of combining them with vectorized text as well as multiple stacking configurations for the machine learning algorithms. Our findings demonstrate that the proposed methods consistently outperform vectorization-only baselines on an imbalanced dataset, with a smaller improvement to a balanced one. Specifically, we achieved an F1\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$F_1$$\end{document} measure of 0.9843 on the balanced set and 0.9656 on the imbalanced one by stacking multiple different classifiers that were trained on the content and stylometric features separately, improving baselines by more than 2.2% for the imbalanced dataset. As such, our work contributes to the ongoing efforts in cybersecurity by further enhancing the performance of state-of-the-art phishing email detection systems.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Phishing Email Detection Based on Hybrid Features
    Yang, Zhuorao
    Qiao, Chen
    Kan, Wanling
    Qiu, Junji
    2018 4TH INTERNATIONAL CONFERENCE ON ENVIRONMENTAL SCIENCE AND MATERIAL APPLICATION, 2019, 252
  • [2] Phishing Email Detection Technique by using Hybrid Features
    Form, Lew May
    Chiew, Kang Leng
    Sze, San Nah
    Tiong, Wei King
    2015 9TH INTERNATIONAL CONFERENCE ON IT IN ASIA (CITA), 2015,
  • [3] A hybrid firefly and support vector machine classifier for phishing email detection
    Adewumi, Oluyinka Aderemi
    Akinyelu, Ayobami Andronicus
    KYBERNETES, 2016, 45 (06) : 977 - 994
  • [4] Enhancing Phishing Email Detection through Ensemble Learning and Undersampling
    Qi, Qinglin
    Wang, Zhan
    Xu, Yijia
    Fang, Yong
    Wang, Changhui
    APPLIED SCIENCES-BASEL, 2023, 13 (15):
  • [5] Automatically Generating Classifier for Phishing Email Prediction
    Ma, Liping
    Torney, Rosemary
    Watters, Paul
    Brown, Simon
    2009 10TH INTERNATIONAL SYMPOSIUM ON PERVASIVE SYSTEMS, ALGORITHMS, AND NETWORKS (ISPAN 2009), 2009, : 779 - 783
  • [6] Overconfidence in Phishing Email Detection
    Wang, Jingguo
    Li, Yuan
    Rao, H. Raghav
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SYSTEMS, 2016, 17 (11): : 759 - 783
  • [7] Email Embeddings for Phishing Detection
    Gutierrez, Luis Felipe
    Abri, Faranak
    Armstrong, Miriam
    Namin, Akbar Siami
    Jones, Keith S.
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 2087 - 2092
  • [8] Spear Phishing Email Detection with Multiple Reputation Features and Sample Enhancement
    Ling, Zhiting
    Feng, Huamin
    Ding, Xiong
    Wang, Xuren
    Gao, Chang
    Yang, Peian
    SCIENCE OF CYBER SECURITY, SCISEC 2022, 2022, 13580 : 522 - 538
  • [9] Cue Utilization, Phishing Feature and Phishing Email Detection
    Bayl-Smith, Piers
    Sturman, Daniel
    Wiggins, Mark
    FINANCIAL CRYPTOGRAPHY AND DATA SECURITY, FC 2020, 2020, 12063 : 56 - 70
  • [10] Analyzing Social and Stylometric Features to Identify Spear phishing Emails
    Dewan, Prateek
    Kashyap, Anand
    Kumaraguru, Ponnurangam
    PROCEEDINGS OF THE 2014 APWG SYMPOSIUM ON ELECTRONIC CRIME RESEARCH (ECRIME), 2014,