Detection of Phishing Emails using Data Mining Algorithms

被引:0
|
作者
Smadi, Sami [1 ]
Aslam, Nauman [1 ]
Zhang, Li [1 ]
Alasem, Rafe [2 ]
Hossain, M. A. [3 ]
机构
[1] Northumbria Univ, Fac Engn & Environm, Dept Comp Sci & Digital Technol, Newcastle Upon Tyne NE1 8ST, Tyne & Wear, England
[2] Imam Mohammad Ibn Saud Islam Univ, Fac Engn, Dept Elect Engn, Riyadh, Saudi Arabia
[3] Anglia Ruskin Univ, Fac Sci & Technol, Anglia Ruskin IT Res Inst, Cambridge, England
来源
2015 9TH INTERNATIONAL CONFERENCE ON SOFTWARE, KNOWLEDGE, INFORMATION MANAGEMENT AND APPLICATIONS (SKIMA) | 2015年
关键词
Phishing; Classification algorithms; Data mining;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes an intelligent model for detection of phishing emails which depends on a preprocessing phase that extracts a set of features concerning different email parts. The extracted features are classified using the J48 classification algorithm. We experimented with a total of 23 features that have been used in the literature. Ten-fold cross-validation was applied for training, testing and validation. The primary focus of this paper is to enhance the overall metrics values of email classification by focusing on the preprocessing phase and determine the best algorithm that can be used in this field. The results show the benefits of using our preprocessing phase to extract features from the dataset. The model achieved 98.87% accuracy for the random forest algorithm, which is the highest registered so far for an approved dataset. A comparison of ten different classification algorithms demonstrates their merits and capabilities through a set of experiments.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Data mining algorithms for land cover change detection: a review
    Panigrahi, Sangram
    Verma, Kesari
    Tripathi, Priyanka
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2017, 42 (12): : 2081 - 2097
  • [42] Data mining algorithms for land cover change detection: a review
    Sangram Panigrahi
    Kesari Verma
    Priyanka Tripathi
    Sādhanā, 2017, 42 : 2081 - 2097
  • [43] Climate change forecasting using data mining algorithms
    Khatri, Parul
    Arjariya, Tripti
    Mitra, Nikita Shivhare
    AQUA-WATER INFRASTRUCTURE ECOSYSTEMS AND SOCIETY, 2023, 72 (06) : 1065 - 1083
  • [44] Predicting user entries by using data mining algorithms
    Alhaj, Basel A.
    Maghari, Ashraf Y. A.
    2017 PALESTINIAN INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (PICICT), 2017, : 110 - 114
  • [45] Prediction of Heart Diseases Using Data Mining Algorithms
    AL-Jammali K.
    Informatica (Slovenia), 2023, 47 (05): : 57 - 62
  • [46] Social Media Analytics Using Data Mining Algorithms
    Anand, Harnoor
    Mathur, Sandeep
    SUSTAINABLE COMMUNICATION NETWORKS AND APPLICATION, ICSCN 2019, 2020, 39 : 12 - 23
  • [47] Algorithms for Telemetry Data Mining using Discrete Attributes
    Ofer, Roy B.
    Eldar, Adi
    Shalev, Adi
    Resheff, Yehezkel S.
    ICPRAM: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2017, : 309 - 317
  • [48] Field Studies on the Impact of Cryptographic Signatures and Encryption on Phishing Emails
    Pham, Stefanie
    Schopp, Matthias
    Stiemert, Lars
    Seeber, Sebastian
    Poehn, Daniela
    Hommel, Wolfgang
    ICISSP: PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS SECURITY AND PRIVACY, 2021, : 384 - 390
  • [49] To Recognize and Analyze Spam Domains from Spam Emails by Data Mining
    Patel, Kavita
    Dubey, Sanjay Kumar
    PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 4030 - 4035
  • [50] A DATA MINING APPROACH FOR THE ANALYSIS OF "STOCK-TOUTING" SPAM EMAILS
    Zaki, Mohamed
    Theodoulidis, Babis
    Solis, David Diaz
    INFORMATION TECHNOLOGIES' 2010, 2010, : 70 - 79