Identification of Phishing URLs Using Machine Learning Models

被引:0
|
作者
Vivek, Meghashyam [1 ]
Premjith, Nithin [1 ]
Johnson, Aaron Antonio [1 ]
Maurya, Ashutosh Kumar [1 ]
Jingle, I. Diana Jeba [1 ]
机构
[1] Christ, Bangalore, Karnataka, India
来源
FOURTH CONGRESS ON INTELLIGENT SYSTEMS, VOL 3, CIS 2023 | 2024年 / 865卷
关键词
XGBoost; Phishing; Prediction; Machine learning; Classifier;
D O I
10.1007/978-981-99-9043-6_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, we provide a machine learning-based method for identifying phishing URLs. Sixteen features, including Have IP, Have At, URL Length, URL Depth, Non-standard double slash, HTTPS domain, Shortened URL, Hyphen Count, DNS Record, Domain age, Domain active, iFrame, Mouse Over, Right click, Web Forwards, and Label, were extracted from the 600,000 URLs we gathered as a dataset of legitimate and phishing URLs. We then used this dataset to train a variety of machine learning models. These included standalone models such Naive Bayes, Logistic Regression, Decision Trees, and K-Nearest Neighbors (KNN). We also used ensemble models like Hard Voting, XGBoost, Random Forests, and AdaBoost. Finally, we used deep learning models such as Artificial Neural Networks (ANN), Long Short-Term Memory (LSTM), Gated Recurrent Units (GRU) and Convolutional Neural Networks (CNN). On evaluation of performance metrics like accuracy, precision, recall, train time and prediction time it was found that XGBoost provides the best performance across all categories.
引用
收藏
页码:209 / 219
页数:11
相关论文
共 50 条
  • [21] Classification of Malicious URLs Using Machine Learning
    Abad, Shayan
    Gholamy, Hassan
    Aslani, Mohammad
    SENSORS, 2023, 23 (18)
  • [22] Detection of malicious URLs using machine learning
    Reyes-Dorta, Nuria
    Caballero-Gil, Pino
    Rosa-Remedios, Carlos
    WIRELESS NETWORKS, 2024, 30 (09) : 7543 - 7560
  • [23] Detection of Phishing URLs by Using Deep Learning Approach and Multiple Features Combinations
    Rasymas, Tomas
    Dovydaitis, Laurynas
    BALTIC JOURNAL OF MODERN COMPUTING, 2020, 8 (03): : 471 - 483
  • [24] Phishing URL detection using machine learning methods
    Ahammad, S. K. Hasane
    Kale, Sunil D.
    Upadhye, Gopal D.
    Pande, Sandeep Dwarkanath
    Babu, E. Venkatesh
    Dhumane, Amol, V
    Bahadur, Dilip Kumar Jang
    ADVANCES IN ENGINEERING SOFTWARE, 2022, 173
  • [25] A Deep Learning for Arabic SMS Phishing Based on URLs Detection
    Alsufyani, Sadeem
    Alajmani, Samah
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2025, 16 (01) : 388 - 396
  • [26] Detecting Ambiguous Phishing Certificates using Machine Learning
    Homayoun, Sajad
    Hageman, Kaspar
    Afzal-Houshmand, Sam
    36TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2022), 2022, : 1 - 6
  • [27] Enhancing Phishing Detection: A Machine Learning Approach With Feature Selection and Deep Learning Models
    Nayak, Ganesh S.
    Muniyal, Balachandra
    Belavagi, Manjula C.
    IEEE ACCESS, 2025, 13 : 33308 - 33320
  • [28] Prediction of phishing websites using machine learning
    Pandey, Mithilesh Kumar
    Singh, Munindra Kumar
    Pal, Saurabh
    Tiwari, B. B.
    SPATIAL INFORMATION RESEARCH, 2023, 31 (02) : 157 - 166
  • [29] Detecting Malicious URLs Using Machine Learning Techniques: Review and Research Directions
    Aljabri, Malak
    Altamimi, Hanan S.
    Albelali, Shahd A.
    Al-Harbi, Maimunah
    Alhuraib, Haya T.
    Alotaibi, Najd K.
    Alahmadi, Amal A.
    Alhaidari, Fahd
    Mohammad, Rami Mustafa A.
    Salah, Khaled
    IEEE ACCESS, 2022, 10 : 121395 - 121417
  • [30] Detection of Phishing Websites using Machine Learning
    Razaque, Abdul
    Frej, Mohamed Ben Haj
    Sabyrov, Dauren
    Shaikhyn, Aidana
    Amsaad, Fathi
    Oun, Ahmed
    2020 IEEE CLOUD SUMMIT, 2020, : 103 - 107