Performance Assessment of Multiple Machine Learning Classifiers for Detecting the Phishing URLs

被引：3

作者：

Rahman, Sheikh Shah Mohammad Motiur ^{[1
]}

Rafiq, Fatama Binta ^{[1
]}

Toma, Tapushe Rabaya ^{[1
]}

Hossain, Syeda Sumbul ^{[1
]}

Biplob, Khalid Been Badruzzaman ^{[1
]}

机构：

[1] Daffodil Int Univ, Dept Software Engn, Dhaka, Bangladesh

来源：

DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT-2K19 | 2020年 / 1079卷

关键词：

Phishing; Malicious URLs; Anti-Phishing; Phishing detection;

D O I：

10.1007/978-981-15-1097-7_25

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the field of information security, phishing URLs detection and prevention has recently become egregious. For detecting, phishing attacks several anti-phishing systems have already been proposed by researchers. The performance of those systems can be affected due to the lack of proper selection of machine learning classifiers along with the types of feature sets. A details investigation on machine learning classifiers (KNN, DT, SVM, RF, ERT and GBT) along with three publicly available datasets with multidimensional feature sets have been presented on this paper. The performance of the classifiers has been evaluated by confusion matrix, precision, recall, F1-score, accuracy and misclassification rate. The best output obtained from Random Forest and Extremely Randomized Tree with dataset one and three (binary class feature set) of 97% and 98% accuracy accordingly. In multiclass feature set (dataset two), Gradient Boosting Tree provides highest performance with 92% accuracy.

引用

页码：285 / 296

页数：12

共 50 条

[31] Evasion Attacks and Defense Mechanisms for Machine Learning-Based Web Phishing Classifiers
Pillai, Manu J.
Remya, S.
Devika, V.
Ramasubbareddy, Somula
Cho, Yongyun
IEEE ACCESS, 2024, 12 : 19375 - 19387
[32] Detecting malicious URLs. A semi-supervised machine learning system approach
Gabriel, Anton Dan
Gavrilut, Dragos Teodor
Alexandru, Baetu Ioan
Stefan, Popescu Adrian
PROCEEDINGS OF 2016 18TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC), 2016, : 233 - 239
[33] CCrFS: Combine Correlation Features Selection for Detecting Phishing Websites Using Machine Learning
Moedjahedy, Jimmy
Setyanto, Arief
Alarfaj, Fawaz Khaled
Alreshoodi, Mohammed
FUTURE INTERNET, 2022, 14 (08)
[34] Performance Comparison of Classifiers on Reduced Phishing Website Dataset
Karabatak, Murat
Mustafa, Twana
2018 6TH INTERNATIONAL SYMPOSIUM ON DIGITAL FORENSIC AND SECURITY (ISDFS), 2018, : 232 - 236
[35] CANTINA+: A Feature-Rich Machine Learning Framework for Detecting Phishing Web Sites
Xiang, Guang
Hong, Jason
Rose, Carolyn P.
Cranor, Lorrie
ACM TRANSACTIONS ON INFORMATION AND SYSTEM SECURITY, 2011, 14 (02)
[36] Phishing Attacks: Detecting and Preventing Infected E-mails Using Machine Learning Methods
Ona, Diego
Zapata, Lenin
Fuertes, Walter
Rodriguez, German
Benavides, Eduardo
Toulkeridis, Theofilos
2019 3RD CYBER SECURITY IN NETWORKING CONFERENCE (CSNET), 2019,
[37] Robust Ensemble Machine Learning Model for Filtering Phishing URLs: Expandable Random Gradient Stacked Voting Classifier (ERG-SVC)
Indrasiri, Pubudu L.
Halgamuge, Malka N.
Mohammad, Azeem
IEEE ACCESS, 2021, 9 : 150142 - 150161
[38] Detecting Phishing SMS Based on Multiple Correlation Algorithms
Sonowal G.
SN Computer Science, 2020, 1 (6)
[39] Comparative evaluation of machine learning algorithms for phishing site detection
Almujahid, Noura Fahad
Haq, Mohd Anul
Alshehri, Mohammed
PEERJ COMPUTER SCIENCE, 2024, 10
[40] Machine learning models for phishing detection from TLS traffic
Munish Kumar
Cheemaladinne Kondaiah
Alwyn Roshan Pais
Routhu Srinivasa Rao
Cluster Computing, 2023, 26 : 3263 - 3277

← 1 2 3 4 5 →