ChatPhishDetector: Detecting Phishing Sites Using Large Language Models

被引:2
作者
Koide, Takashi [1 ]
Nakano, Hiroki [2 ]
Chiba, Daiki
机构
[1] NTT Secur Holdings Corp, Tokyo 1010021, Japan
[2] NTT Corp, Tokyo 1010021, Japan
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Phishing; Uniform resource locators; Large language models; Crawlers; Codes; Web pages; Security; Accuracy; Visualization; Cognition; phishing sites; social engineering;
D O I
10.1109/ACCESS.2024.3483905
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Large Language Models (LLMs), such as ChatGPT, are significantly impacting various fields. While LLMs have been extensively studied for code generation and text synthesis, their application in detecting malicious web content, particularly phishing sites, remains largely unexplored. To counter the increasing cyber-attacks that leverage LLMs for creating more sophisticated and convincing phishing content, it is crucial to automate detection by harnessing LLMs' advanced capabilities. This paper introduces ChatPhishDetector, a novel system that employs LLMs to identify phishing sites. Our approach involves using a web crawler to collect website information, generating prompts for LLMs based on the gathered data, and extracting detection results from LLM responses. This system enables accurate detection of multilingual phishing sites by identifying impersonated brands and social engineering techniques within the entire website context, without requiring machine learning model training. We evaluated our system's performance using our own dataset and compared it with baseline systems and several LLMs. Experiments using GPT-4V showed exceptional results, achieving 98.7% precision and 99.6% recall, surpassing the detection performance of other LLMs and existing systems. These findings highlight the potential of LLMs for protecting users from online fraudulent activities and provide crucial insights for strengthening defenses against phishing attacks.
引用
收藏
页码:154381 / 154400
页数:20
相关论文
共 50 条
  • [31] Using Large Language Models in Business Processes
    Grisold, Thomas
    vom Brocke, Jan
    Kratsch, Wolfgang
    Mendling, Jan
    Vidgof, Maxim
    [J]. BUSINESS PROCESS MANAGEMENT, BPM 2023, 2023, 14159 : XXIX - XXXI
  • [32] Accelerating Pharmacovigilance using Large Language Models
    Prakash, Mukkamala Venkata Sai
    Parab, Ganesh
    Veeramalla, Meghana
    Reddy, Siddartha
    Varun, V.
    Gopalakrishnan, Saisubramaniam
    Pagidipally, Vishal
    Vaddina, Vishal
    [J]. PROCEEDINGS OF THE 17TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, WSDM 2024, 2024, : 1182 - 1183
  • [33] Using Natural Language Processing for Phishing Detection
    Jonker, Richard Adolph Aires
    Poudel, Roshan
    Pedrosa, Tiago
    Lopes, Rui Pedro
    [J]. OPTIMIZATION, LEARNING ALGORITHMS AND APPLICATIONS, OL2A 2021, 2021, 1488 : 540 - 552
  • [34] Large Language Models and Computer Security
    Iyengar, Arun
    Kundu, Ashish
    [J]. 2023 5TH IEEE INTERNATIONAL CONFERENCE ON TRUST, PRIVACY AND SECURITY IN INTELLIGENT SYSTEMS AND APPLICATIONS, TPS-ISA, 2023, : 307 - 313
  • [35] Using Large Language Models to Improve Sentiment Analysis in Latvian Language
    Purvins, Pauls
    Urtans, Evalds
    Caune, Vairis
    [J]. BALTIC JOURNAL OF MODERN COMPUTING, 2024, 12 (02): : 165 - 175
  • [36] Detecting phishing attacks using a combined model of LSTM and CNN
    Ariyadasa, Subhash
    Fernando, Subha
    Fernando, Shantha
    [J]. INTERNATIONAL JOURNAL OF ADVANCED AND APPLIED SCIENCES, 2020, 7 (07): : 56 - 67
  • [37] Verbal lie detection using Large Language Models
    Loconte, Riccardo
    Russo, Roberto
    Capuozzo, Pasquale
    Pietrini, Pietro
    Sartori, Giuseppe
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01)
  • [38] Using large language models to create narrative events
    Bartalesi, Valentina
    Lenzi, Emanuele
    De Martino, Claudio
    [J]. PEERJ COMPUTER SCIENCE, 2024, 10
  • [39] Agile Project Management Using Large Language Models
    Dhruva, G.
    Shettigar, Ishaan
    Parthasarthy, Srikrshna
    Sapna, V. M.
    [J]. 2024 5TH INTERNATIONAL CONFERENCE ON INNOVATIVE TRENDS IN INFORMATION TECHNOLOGY, ICITIIT 2024, 2024,
  • [40] Cyber Threat Hunting Using Large Language Models
    Tanksale, Vinayak
    [J]. PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, VOL 5, ICICT 2024, 2024, 1000 : 629 - 641