ChatPhishDetector: Detecting Phishing Sites Using Large Language Models

被引:2
作者
Koide, Takashi [1 ]
Nakano, Hiroki [2 ]
Chiba, Daiki
机构
[1] NTT Secur Holdings Corp, Tokyo 1010021, Japan
[2] NTT Corp, Tokyo 1010021, Japan
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Phishing; Uniform resource locators; Large language models; Crawlers; Codes; Web pages; Security; Accuracy; Visualization; Cognition; phishing sites; social engineering;
D O I
10.1109/ACCESS.2024.3483905
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Large Language Models (LLMs), such as ChatGPT, are significantly impacting various fields. While LLMs have been extensively studied for code generation and text synthesis, their application in detecting malicious web content, particularly phishing sites, remains largely unexplored. To counter the increasing cyber-attacks that leverage LLMs for creating more sophisticated and convincing phishing content, it is crucial to automate detection by harnessing LLMs' advanced capabilities. This paper introduces ChatPhishDetector, a novel system that employs LLMs to identify phishing sites. Our approach involves using a web crawler to collect website information, generating prompts for LLMs based on the gathered data, and extracting detection results from LLM responses. This system enables accurate detection of multilingual phishing sites by identifying impersonated brands and social engineering techniques within the entire website context, without requiring machine learning model training. We evaluated our system's performance using our own dataset and compared it with baseline systems and several LLMs. Experiments using GPT-4V showed exceptional results, achieving 98.7% precision and 99.6% recall, surpassing the detection performance of other LLMs and existing systems. These findings highlight the potential of LLMs for protecting users from online fraudulent activities and provide crucial insights for strengthening defenses against phishing attacks.
引用
收藏
页码:154381 / 154400
页数:20
相关论文
共 50 条
[41]   Using Large Language Models to Understand Telecom Standards [J].
Karapantelakis, Athanasios ;
Thakur, Mukesh ;
Nikou, Alexandros ;
Moradi, Farnaz ;
Olrog, Christian ;
Gaim, Fitsum ;
Holm, Henrik ;
Nimara, Doumitrou Daniil ;
Huang, Vincent .
2024 IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING FOR COMMUNICATION AND NETWORKING, ICMLCN 2024, 2024, :440-446
[42]   Corporate Event Predictions Using Large Language Models [J].
Xiao, Zhaomin ;
Mai, Zhelu ;
Xu, Zhuoer ;
Cui, Yachen ;
Li, Jiancheng .
2023 10TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE, ISCMI, 2023, :193-197
[43]   Conversational Agents for Dementia using Large Language Models [J].
Favela, Jesus ;
Cruz-Sandoval, Dagoberto ;
Parra, Mario O. .
2023 MEXICAN INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE, ENC, 2024,
[44]   Using Large Language Models for the Interpretation of Building Regulations [J].
Fuchs, Stefan ;
Witbrock, Michael ;
Dimyadi, Johannes ;
Amor, Robert .
Journal of Engineering, Project, and Production Management, 2024, 14 (04)
[45]   Classifying legal interpretations using large language models [J].
Dugac, Gaspar ;
Altwicker, Tilmann .
ARTIFICIAL INTELLIGENCE AND LAW, 2025,
[46]   Using Large Language Models for Math Information Retrieval [J].
Mansouri, Behrooz ;
Maarefdoust, Reihaneh .
PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, :2693-2697
[47]   Discovering prerequisite relations using large language models [J].
Aytekin, Mehmet Cem ;
Saygin, Yucel .
INTERACTIVE LEARNING ENVIRONMENTS, 2025, 33 (02) :1670-1688
[48]   Investigating the Potential of Using Large Language Models for Scheduling [J].
Jobson, Deddy ;
Li, Yilin .
PROCEEDINGS OF THE 1ST ACM INTERNATIONAL CONFERENCE ON AI-POWERED SOFTWARE, AIWARE 2024, 2024, :170-171
[49]   Demystifying large language models in second language development research [J].
Cong, Yan .
COMPUTER SPEECH AND LANGUAGE, 2025, 89
[50]   Detecting Cloud-Based Phishing Attacks by Combining Deep Learning Models [J].
Jha, Birendra ;
Atre, Medha ;
Rao, Ashwini .
2022 IEEE 4TH INTERNATIONAL CONFERENCE ON TRUST, PRIVACY AND SECURITY IN INTELLIGENT SYSTEMS, AND APPLICATIONS, TPS-ISA, 2022, :130-139