BustedURL: Collaborative Multi-agent System for Real-Time Malicious URL Detection

被引:0
作者
Sundarraj, Jayaprakash Nariyambut [1 ]
Zhang, Yan [2 ]
Itharaju, Santosh Kapil Dev [1 ]
Saleh, Ahmed [1 ]
Ahmed, Saad [1 ]
Azam, Sami [2 ]
机构
[1] Charles Darwin Univ, Sydney Campus, Sydney, NSW 2000, Australia
[2] Charles Darwin Univ, Casuarina 0810, Australia
来源
DATABASES THEORY AND APPLICATIONS, ADC 2024 | 2025年 / 15449卷
关键词
Malicious URL detection; multi-agent system; phishing detection; reinforcement learning; real-time detection;
D O I
10.1007/978-981-96-1242-0_34
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Phishing is a major security issue in cyberspace, and time-stamping the detection of malicious URLs is crucial to safeguarding users and systems. Current approaches like URLNet and URLTran are reasonable, but the methods generally cannot adapt to quick changes and cannot be easily scaled as a standalone agent. In this paper, we present BustedURL, a sustainable framework designed to address these limitations. Leveraging a distributed, collaborative multi-agent architecture, BustedURL incorporates advanced methodologies, including transformers, ensemble learning, stacking, big-data aggregation, and sophisticated learning pipelines. These innovations enhance the framework's adaptability across diverse environments. We empirically evaluate the proposed architecture using real-time datasets from OpenPhish and various other sources, demonstrating that BustedURL outperforms current state-of-the-art solutions across various performance metrics in dynamic phishing contexts, owing to its distributed, scalable, and adaptive characteristics. Specifically, the scalability experiments demonstrated an approximately 50-fold improvement compared to the competitive baseline models.
引用
收藏
页码:463 / 476
页数:14
相关论文
共 36 条
[1]   A comprehensive performance analysis of Apache Hadoop and Apache Spark for large scale data sets using HiBench [J].
Ahmed, N. ;
Barczak, Andre L. C. ;
Susnjak, Teo ;
Rashid, Mohammed A. .
JOURNAL OF BIG DATA, 2020, 7 (01)
[2]   Effects of dataset size and interactions on the prediction performance of logistic regression and deep learning models [J].
Bailly, Alexandre ;
Blanc, Corentin ;
Francis, Elie ;
Guillotin, Thierry ;
Jamal, Fadi ;
Wakim, Bechara ;
Roy, Pascal .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 213
[3]   PhishGNN: A Phishing Website Detection Framework using Graph Neural Networks [J].
Bilot, Tristan ;
Geis, Gregoire ;
Hammi, Badis .
SECRYPT : PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON SECURITY AND CRYPTOGRAPHY, 2022, :428-435
[4]   HELPHED: Hybrid Ensemble Learning PHishing Email Detection [J].
Bountakas, Panagiotis ;
Xenakis, Christos .
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2023, 210
[5]   A Cooperative Multi-Agent Reinforcement Learning Method Based on Coordination Degree [J].
Cui, Haoyan ;
Zhang, Zhen .
IEEE ACCESS, 2021, 9 :123805-123814
[6]  
Devlin J, 2019, Arxiv, DOI [arXiv:1810.04805, DOI 10.48550/ARXIV.1810.04805, 10.48550/arXiv.1810.04805]
[7]   Magician's Corner: 9. Performance Metrics for Machine Learning Models [J].
Erickson, Bradley J. ;
Kitamura, Felipe .
RADIOLOGY-ARTIFICIAL INTELLIGENCE, 2021, 3 (03)
[8]   Improving the Latency of Python']Python-based Web Applications [J].
Esteves, Antonio ;
Fernandes, Joao .
WEBIST: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGIES, 2019, :193-201
[9]  
Garg N., 2013, Apache Kafka
[10]   Cyber Threat Intelligence-Based Malicious URL Detection Model Using Ensemble Learning [J].
Ghaleb, Fuad A. ;
Alsaedi, Mohammed ;
Saeed, Faisal ;
Ahmad, Jawad ;
Alasli, Mohammed .
SENSORS, 2022, 22 (09)