Devising and Detecting Phishing Emails Using Large Language Models

被引:9
|
作者
Heiding, Fredrik [1 ,2 ]
Schneier, Bruce [3 ]
Vishwanath, Arun [4 ]
Bernstein, Jeremy [5 ]
Park, Peter S. [5 ]
机构
[1] Harvard Univ, Harvard John A Paulson Sch Engn & Appl Sci, Cambridge, MA 02138 USA
[2] KTH Royal Inst Technol, S-11428 Stockholm, Sweden
[3] Harvard Univ, Harvard Kennedy Sch, Cambridge, MA 02138 USA
[4] Avant Res Grp, Buffalo, NY 14214 USA
[5] MIT, Cambridge, MA 02139 USA
关键词
Phishing; large language models; social engineering; artificial intelligence;
D O I
10.1109/ACCESS.2024.3375882
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
AI programs, built using large language models, make it possible to automatically create phishing emails based on a few data points about a user. The V-Triad is a set of rules for manually designing phishing emails to exploit our cognitive heuristics and biases. In this study, we compare the performance of phishing emails created automatically by GPT-4 and manually using the V-Triad. We also combine GPT-4 with the V-Triad to assess their combined potential. A fourth group, exposed to generic phishing emails, was our control group. We use a red teaming approach by simulating attackers and emailing 112 participants recruited for the study. The control group emails received a click-through rate between 19-28%, the GPT-generated emails 30-44%, emails generated by the V-Triad 69-79%, and emails generated by GPT and the V-Triad 43-81%. Each participant was asked to explain why they pressed or did not press a link in the email. These answers often contradict each other, highlighting the importance of personal differences. Next, we used four popular large language models (GPT, Claude, PaLM, and LLaMA) to detect the intention of phishing emails and compare the results to human detection. The language models demonstrated a strong ability to detect malicious intent, even in non-obvious phishing emails. They sometimes surpassed human detection, although often being slightly less accurate than humans. Finally, we analyze of the economic aspects of AI-enabled phishing attacks, showing how large language models increase the incentives of phishing and spear phishing by reducing their costs.
引用
收藏
页码:42131 / 42146
页数:16
相关论文
共 50 条
  • [31] Meaning and understanding in large language models
    Havlik, Vladimir
    SYNTHESE, 2024, 205 (01)
  • [32] Flying Into the Future With Large Language Models
    Kanjilal, Sanjat
    CLINICAL INFECTIOUS DISEASES, 2024, 78 (04) : 867 - 869
  • [33] A Surgical Perspective on Large Language Models
    Miller, Robert
    ANNALS OF SURGERY, 2023, 278 (02) : E211 - E213
  • [34] Eliciting metaknowledge in Large Language Models
    Longo, Carmelo Fabio
    Mongiovi, Misael
    Bulla, Luana
    Lieto, Antonio
    COGNITIVE SYSTEMS RESEARCH, 2025, 91
  • [35] Using Large Language Models to Improve Sentiment Analysis in Latvian Language
    Purvins, Pauls
    Urtans, Evalds
    Caune, Vairis
    BALTIC JOURNAL OF MODERN COMPUTING, 2024, 12 (02): : 165 - 175
  • [36] Detecting phishing attacks using a combined model of LSTM and CNN
    Ariyadasa, Subhash
    Fernando, Subha
    Fernando, Shantha
    INTERNATIONAL JOURNAL OF ADVANCED AND APPLIED SCIENCES, 2020, 7 (07): : 56 - 67
  • [37] From statistics to deep learning: Using large language models in psychiatric research
    Hua, Yining
    Beam, Andrew
    Chibnik, Lori B.
    Torous, John
    INTERNATIONAL JOURNAL OF METHODS IN PSYCHIATRIC RESEARCH, 2025, 34 (01)
  • [38] Corporate Event Predictions Using Large Language Models
    Xiao, Zhaomin
    Mai, Zhelu
    Xu, Zhuoer
    Cui, Yachen
    Li, Jiancheng
    2023 10TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE, ISCMI, 2023, : 193 - 197
  • [39] Using Large Language Models to Understand Telecom Standards
    Karapantelakis, Athanasios
    Thakur, Mukesh
    Nikou, Alexandros
    Moradi, Farnaz
    Olrog, Christian
    Gaim, Fitsum
    Holm, Henrik
    Nimara, Doumitrou Daniil
    Huang, Vincent
    2024 IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING FOR COMMUNICATION AND NETWORKING, ICMLCN 2024, 2024, : 440 - 446
  • [40] Conversational Agents for Dementia using Large Language Models
    Favela, Jesus
    Cruz-Sandoval, Dagoberto
    Parra, Mario O.
    2023 MEXICAN INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE, ENC, 2024,