ChatPhishDetector: Detecting Phishing Sites Using Large Language Models

被引:2
|
作者
Koide, Takashi [1 ]
Nakano, Hiroki [2 ]
Chiba, Daiki
机构
[1] NTT Secur Holdings Corp, Tokyo 1010021, Japan
[2] NTT Corp, Tokyo 1010021, Japan
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Phishing; Uniform resource locators; Large language models; Crawlers; Codes; Web pages; Security; Accuracy; Visualization; Cognition; phishing sites; social engineering;
D O I
10.1109/ACCESS.2024.3483905
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Large Language Models (LLMs), such as ChatGPT, are significantly impacting various fields. While LLMs have been extensively studied for code generation and text synthesis, their application in detecting malicious web content, particularly phishing sites, remains largely unexplored. To counter the increasing cyber-attacks that leverage LLMs for creating more sophisticated and convincing phishing content, it is crucial to automate detection by harnessing LLMs' advanced capabilities. This paper introduces ChatPhishDetector, a novel system that employs LLMs to identify phishing sites. Our approach involves using a web crawler to collect website information, generating prompts for LLMs based on the gathered data, and extracting detection results from LLM responses. This system enables accurate detection of multilingual phishing sites by identifying impersonated brands and social engineering techniques within the entire website context, without requiring machine learning model training. We evaluated our system's performance using our own dataset and compared it with baseline systems and several LLMs. Experiments using GPT-4V showed exceptional results, achieving 98.7% precision and 99.6% recall, surpassing the detection performance of other LLMs and existing systems. These findings highlight the potential of LLMs for protecting users from online fraudulent activities and provide crucial insights for strengthening defenses against phishing attacks.
引用
收藏
页码:154381 / 154400
页数:20
相关论文
共 50 条
  • [1] Devising and Detecting Phishing Emails Using Large Language Models
    Heiding, Fredrik
    Schneier, Bruce
    Vishwanath, Arun
    Bernstein, Jeremy
    Park, Peter S.
    IEEE ACCESS, 2024, 12 : 42131 - 42146
  • [2] Benchmarking and Evaluating Large Language Models in Phishing Detection for Small and Midsize Enterprises: A Comprehensive Analysis
    Zhang, Jun
    Wu, Peiqiao
    London, Jeffrey
    Tenney, Dan
    IEEE ACCESS, 2025, 13 : 28335 - 28352
  • [3] Detecting Phishing Sites Using URLs Collected from Emails
    Wang, Chuan-Sheng
    Hsu, Fu-Hau
    Chen, Shih-Jen
    Hwang, Yan-Ling
    Wu, Min-Hao
    APPLIED SCIENCE AND PRECISION ENGINEERING INNOVATION, PTS 1 AND 2, 2014, 479-480 : 916 - +
  • [4] Evaluating the Efficacy of Large Language Models in Identifying Phishing Attempts
    Patel, Het
    Reiman, Umair
    Iqbal, Farkhund
    2024 16TH INTERNATIONAL CONFERENCE ON HUMAN SYSTEM INTERACTION, HSI 2024, 2024,
  • [5] The Dual-Edged Sword of Large Language Models in Phishing
    Siemerink, Alec
    Jansen, Slinger
    Labunets, Katsiaryna
    SECURE IT SYSTEMS, NORDSEC 2024, 2025, 15396 : 258 - 279
  • [6] Detecting covert channels in cloud access control policies using Large Language Models
    Karmarkar, Hrishikesh
    Joshi, Vaibhavi
    Venkatesh, R.
    2024 IEEE INTERNATIONAL CONFERENCE ON CYBER SECURITY AND RESILIENCE, CSR, 2024, : 241 - 246
  • [7] Detecting Phishing Attacks Using Natural Language Processing And Machine Learning
    Banu, Reshma
    Anand, M.
    Kamath, Akshatha C.
    Ashika, S.
    Ujwala, H. S.
    Harshitha, S. N.
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICCS), 2019, : 1210 - 1214
  • [8] A Systematic Review: Detecting Phishing Websites Using Data Mining Models
    Jibat D.
    Jamjoom S.
    Al-Haija Q.A.
    Qusef A.
    Intelligent and Converged Networks, 2023, 4 (04): : 326 - 341
  • [9] Phishing Website Detection Using Deep Learning Models
    Zara, Ume
    Ayyub, Kashif
    Khan, Hikmat Ullah
    Daud, Ali
    Alsahfi, Tariq
    Ahmad, Saima Gulzar
    IEEE ACCESS, 2024, 12 : 167072 - 167087
  • [10] LEVA: Using Large Language Models to Enhance Visual Analytics
    Zhao, Yuheng
    Zhang, Yixing
    Zhang, Yu
    Zhao, Xinyi
    Wang, Junjie
    Shao, Zekai
    Turkay, Cagatay
    Chen, Siming
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2025, 31 (03) : 1830 - 1847