Performance Assessment of Multiple Machine Learning Classifiers for Detecting the Phishing URLs

被引:3
|
作者
Rahman, Sheikh Shah Mohammad Motiur [1 ]
Rafiq, Fatama Binta [1 ]
Toma, Tapushe Rabaya [1 ]
Hossain, Syeda Sumbul [1 ]
Biplob, Khalid Been Badruzzaman [1 ]
机构
[1] Daffodil Int Univ, Dept Software Engn, Dhaka, Bangladesh
来源
DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT-2K19 | 2020年 / 1079卷
关键词
Phishing; Malicious URLs; Anti-Phishing; Phishing detection;
D O I
10.1007/978-981-15-1097-7_25
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the field of information security, phishing URLs detection and prevention has recently become egregious. For detecting, phishing attacks several anti-phishing systems have already been proposed by researchers. The performance of those systems can be affected due to the lack of proper selection of machine learning classifiers along with the types of feature sets. A details investigation on machine learning classifiers (KNN, DT, SVM, RF, ERT and GBT) along with three publicly available datasets with multidimensional feature sets have been presented on this paper. The performance of the classifiers has been evaluated by confusion matrix, precision, recall, F1-score, accuracy and misclassification rate. The best output obtained from Random Forest and Extremely Randomized Tree with dataset one and three (binary class feature set) of 97% and 98% accuracy accordingly. In multiclass feature set (dataset two), Gradient Boosting Tree provides highest performance with 92% accuracy.
引用
收藏
页码:285 / 296
页数:12
相关论文
共 50 条
  • [31] Evasion Attacks and Defense Mechanisms for Machine Learning-Based Web Phishing Classifiers
    Pillai, Manu J.
    Remya, S.
    Devika, V.
    Ramasubbareddy, Somula
    Cho, Yongyun
    IEEE ACCESS, 2024, 12 : 19375 - 19387
  • [32] Detecting malicious URLs. A semi-supervised machine learning system approach
    Gabriel, Anton Dan
    Gavrilut, Dragos Teodor
    Alexandru, Baetu Ioan
    Stefan, Popescu Adrian
    PROCEEDINGS OF 2016 18TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC), 2016, : 233 - 239
  • [33] CCrFS: Combine Correlation Features Selection for Detecting Phishing Websites Using Machine Learning
    Moedjahedy, Jimmy
    Setyanto, Arief
    Alarfaj, Fawaz Khaled
    Alreshoodi, Mohammed
    FUTURE INTERNET, 2022, 14 (08)
  • [34] Performance Comparison of Classifiers on Reduced Phishing Website Dataset
    Karabatak, Murat
    Mustafa, Twana
    2018 6TH INTERNATIONAL SYMPOSIUM ON DIGITAL FORENSIC AND SECURITY (ISDFS), 2018, : 232 - 236
  • [35] CANTINA+: A Feature-Rich Machine Learning Framework for Detecting Phishing Web Sites
    Xiang, Guang
    Hong, Jason
    Rose, Carolyn P.
    Cranor, Lorrie
    ACM TRANSACTIONS ON INFORMATION AND SYSTEM SECURITY, 2011, 14 (02)
  • [36] Phishing Attacks: Detecting and Preventing Infected E-mails Using Machine Learning Methods
    Ona, Diego
    Zapata, Lenin
    Fuertes, Walter
    Rodriguez, German
    Benavides, Eduardo
    Toulkeridis, Theofilos
    2019 3RD CYBER SECURITY IN NETWORKING CONFERENCE (CSNET), 2019,
  • [37] Robust Ensemble Machine Learning Model for Filtering Phishing URLs: Expandable Random Gradient Stacked Voting Classifier (ERG-SVC)
    Indrasiri, Pubudu L.
    Halgamuge, Malka N.
    Mohammad, Azeem
    IEEE ACCESS, 2021, 9 : 150142 - 150161
  • [38] Detecting Phishing SMS Based on Multiple Correlation Algorithms
    Sonowal G.
    SN Computer Science, 2020, 1 (6)
  • [39] Comparative evaluation of machine learning algorithms for phishing site detection
    Almujahid, Noura Fahad
    Haq, Mohd Anul
    Alshehri, Mohammed
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [40] Machine learning models for phishing detection from TLS traffic
    Munish Kumar
    Cheemaladinne Kondaiah
    Alwyn Roshan Pais
    Routhu Srinivasa Rao
    Cluster Computing, 2023, 26 : 3263 - 3277