Identification of Phishing URLs Using Machine Learning Models

被引:0
|
作者
Vivek, Meghashyam [1 ]
Premjith, Nithin [1 ]
Johnson, Aaron Antonio [1 ]
Maurya, Ashutosh Kumar [1 ]
Jingle, I. Diana Jeba [1 ]
机构
[1] Christ, Bangalore, Karnataka, India
来源
FOURTH CONGRESS ON INTELLIGENT SYSTEMS, VOL 3, CIS 2023 | 2024年 / 865卷
关键词
XGBoost; Phishing; Prediction; Machine learning; Classifier;
D O I
10.1007/978-981-99-9043-6_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, we provide a machine learning-based method for identifying phishing URLs. Sixteen features, including Have IP, Have At, URL Length, URL Depth, Non-standard double slash, HTTPS domain, Shortened URL, Hyphen Count, DNS Record, Domain age, Domain active, iFrame, Mouse Over, Right click, Web Forwards, and Label, were extracted from the 600,000 URLs we gathered as a dataset of legitimate and phishing URLs. We then used this dataset to train a variety of machine learning models. These included standalone models such Naive Bayes, Logistic Regression, Decision Trees, and K-Nearest Neighbors (KNN). We also used ensemble models like Hard Voting, XGBoost, Random Forests, and AdaBoost. Finally, we used deep learning models such as Artificial Neural Networks (ANN), Long Short-Term Memory (LSTM), Gated Recurrent Units (GRU) and Convolutional Neural Networks (CNN). On evaluation of performance metrics like accuracy, precision, recall, train time and prediction time it was found that XGBoost provides the best performance across all categories.
引用
收藏
页码:209 / 219
页数:11
相关论文
共 50 条
  • [31] Phishing Websites Detection using Machine Learning
    Kulkarni, Arun
    Brown, Leonard L., III
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (07) : 8 - 13
  • [32] Detecting Phishing Domains Using Machine Learning
    Alnemari, Shouq
    Alshammari, Majid
    APPLIED SCIENCES-BASEL, 2023, 13 (08):
  • [33] Prediction of phishing websites using machine learning
    Mithilesh Kumar Pandey
    Munindra Kumar Singh
    Saurabh Pal
    B. B. Tiwari
    Spatial Information Research, 2023, 31 : 157 - 166
  • [34] Detecting Phishing Websites Using Machine Learning
    Alswailem, Amani
    Alabdullah, Bashayr
    Alrumayh, Norah
    Alsedrani, Aram
    2019 2ND INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS & INFORMATION SECURITY (ICCAIS), 2019,
  • [35] A Novel Algorithm to Detect Phishing URLs
    Hawanna, Varsharani Ramdas
    Kulkarni, V. Y.
    Rane, R. A.
    2016 INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND DYNAMIC OPTIMIZATION TECHNIQUES (ICACDOT), 2016, : 548 - 552
  • [36] A Comprehensive Survey on Identification and Analysis of Phishing Website based on Machine Learning Methods
    Alkawaz, Mohammed Hazim
    Steven, Stephanie Joanne
    Hajamydeen, Asif Iqbal
    Ramli, Rusyaizila
    11TH IEEE SYMPOSIUM ON COMPUTER APPLICATIONS & INDUSTRIAL ELECTRONICS (ISCAIE 2021), 2021, : 82 - 87
  • [37] Robust Ensemble Machine Learning Model for Filtering Phishing URLs: Expandable Random Gradient Stacked Voting Classifier (ERG-SVC)
    Indrasiri, Pubudu L.
    Halgamuge, Malka N.
    Mohammad, Azeem
    IEEE ACCESS, 2021, 9 : 150142 - 150161
  • [38] Detecting Phishing Sites Using URLs Collected from Emails
    Wang, Chuan-Sheng
    Hsu, Fu-Hau
    Chen, Shih-Jen
    Hwang, Yan-Ling
    Wu, Min-Hao
    APPLIED SCIENCE AND PRECISION ENGINEERING INNOVATION, PTS 1 AND 2, 2014, 479-480 : 916 - +
  • [39] An enhanced deep learning-based phishing detection mechanism to effectively identify malicious URLs using variational autoencoders
    Prabakaran, Manoj Kumar
    Chandrasekar, Abinaya Devi
    Meenakshi Sundaram, Parvathy
    IET INFORMATION SECURITY, 2023, 17 (03) : 423 - 440
  • [40] A hybrid DNN–LSTM model for detecting phishing URLs
    Alper Ozcan
    Cagatay Catal
    Emrah Donmez
    Behcet Senturk
    Neural Computing and Applications, 2023, 35 : 4957 - 4973