A high-accuracy phishing website detection method based on machine learning

被引:9
|
作者
Bahaghighat, Mahdi [1 ]
Ghasemi, Majid [1 ]
Ozen, Figen [2 ]
机构
[1] Imam Khomeini Int Univ, Dept Comp Engn, Qazvin, Iran
[2] Halic Univ, Istanbul, Turkiye
关键词
Phishing website detection; Cyber security; Machine learning; Classification; XGBoost;
D O I
10.1016/j.jisa.2023.103553
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The rapid development of e-commerce, e-banking, and social networks has made phishing attack detection one of the most critical technologies in all cyber security systems. To improve the efficiency of anti-phishing techniques, we present an improved predictive model based on machine learning. The proposed method uses six different algorithms; Logistic Regression, K-Nearest Neighbors, Naive Bayes, Random Forest, Support Vector Machine, and Extreme Gradient Boosting (XGBoost). The experiments are based on a public dataset of 58,000 legitimate websites and 30,647 phishing ones, including 112 attributes for each sample. Our evaluations in the feature selection process show that after balancing the dataset and dropping constant features, a noticeable improvement can be achieved. We conducted our evaluation found on eight major unique scenarios. The experimental results of our phishing websites detection (PWD) method indicate remarkable performances in which each algorithm reached an accuracy of more than 93%, and the XGBoost classifier outperforms others with 99.2% overall accuracy, 99.1% precision, 99.4% recall, and 99.1% specificity. In addition, the study achieved optimal run-time of about 1500 ms for the XGBoost algorithm without dimension reduction while using Principal Component Analysis (PCA) reduces it down to just 869 ms. As a result, the proposed approach would be practical in both offline and real-time applications.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] High-Accuracy Wireless Traffic Prediction: A GP-Based Machine Learning Approach
    Xu, Yue
    Xu, Wenjun
    Yin, Feng
    Lin, Jiaru
    Cui, Shuguang
    GLOBECOM 2017 - 2017 IEEE GLOBAL COMMUNICATIONS CONFERENCE, 2017,
  • [42] A High-Accuracy Detection and Estimation Method of Intermodulated Sinusoids
    Yang, Zaiyue
    Chan, C. W.
    Wang, Yiwen
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2011, 58 (10) : 2477 - 2484
  • [43] A Machine-learning based Unbiased Phishing Detection Approach
    Shirazi, Hossein
    Zweigle, Landon
    Ray, Indrakshi
    PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON E-BUSINESS AND TELECOMMUNICATIONS (SECRYPT), VOL 1, 2020, : 423 - 430
  • [44] Phishing detection based on machine learning and feature selection methods
    Almseidin M.
    Abu Zuraiq A.M.
    Al-kasassbeh M.
    Alnidami N.
    International Journal of Interactive Mobile Technologies, 2019, 13 (12) : 71 - 183
  • [45] Phishing Attacks Detection A Machine Learning-Based Approach
    Salahdine, Fatima
    El Mrabet, Zakaria
    Kaabouch, Naima
    2021 IEEE 12TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2021, : 250 - 255
  • [46] Feature Selections for the Machine Learning based Detection of Phishing Websites
    Buber, Ebubekir
    Demir, Onder
    Sahingoz, Ozgur Koray
    2017 INTERNATIONAL ARTIFICIAL INTELLIGENCE AND DATA PROCESSING SYMPOSIUM (IDAP), 2017,
  • [47] Highly accurate phishing URL detection based on machine learning
    Jalil S.
    Usman M.
    Fong A.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (07) : 9233 - 9251
  • [48] Feature Selection Approach for Phishing Detection Based on Machine Learning
    Wei, Yi
    Sekiya, Yuji
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON APPLIED CYBER SECURITY (ACS) 2021, 2022, 378 : 61 - 70
  • [49] High-accuracy positioning method based on reverse navigation solution in pipeline detection
    Huang F.
    Sun L.
    Guo L.
    Li Y.
    Qian F.
    Zhongguo Guanxing Jishu Xuebao/Journal of Chinese Inertial Technology, 2018, 26 (04): : 435 - 439
  • [50] Phishing Website Detection Algorithm Based on Link Structure
    Du Shu-Ying
    He Wang
    2019 INTERNATIONAL CONFERENCE ON ADVANCED ELECTRONIC MATERIALS, COMPUTERS AND MATERIALS ENGINEERING (AEMCME 2019), 2019, 563