Evaluating GPT-4's proficiency in addressing cryptography examinations

被引:2
作者
Mikhalev, Vasily [1 ]
Kopal, Nils [1 ]
Esslinger, Bernhard [1 ]
机构
[1] Univ Siegen, Siegen, Germany
基金
瑞典研究理事会;
关键词
Artificial intelligence; ChatGPT; cryptographic examinations;
D O I
10.1080/01611194.2024.2320368
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In the rapidly advancing domain of artificial intelligence, ChatGPT, powered by the GPT-4 model, has emerged as a state-of-the-art interactive agent, exhibiting substantial capabilities across various domains. This paper aims to assess the efficacy of GPT-4 in addressing and solving problems found within cryptographic examinations. We devised a multi-faceted methodology, presenting the model with a series of cryptographic questions of varying complexities derived from real academic examinations. Our evaluation encompasses both classical and modern cryptographic challenges, focusing on the model's ability to understand, interpret, and generate correct solutions while discerning its limitations. The model was challenged with a spectrum of cryptographic tasks, earning 202 out of 208 points by solving fundamental queries inspired by an oral exam, 80.5 out of 90 points on a written Crypto 1 exam, and 287 out of 385 points on advanced exercises from the Crypto 2 course. The results demonstrate that while GPT-4 shows significant promise in grasping fundamental cryptographic concepts and techniques, certain intricate problems necessitate domain-specific knowledge that may sometimes lie beyond the model's general training. Insights from this study can provide educators, researchers, and examiners with a deeper understanding of how cutting-edge AI models can be both an asset and a potential concern in academic settings related to cryptology. To enhance the clarity and coherence of our work, we utilized ChatGPT-4 to help us in formulating sentences in this paper.
引用
收藏
页码:170 / 185
页数:16
相关论文
共 11 条
[1]  
[Anonymous], 2023, Gpt-4 technical report
[2]  
Choi JH, 2022, J LEGAL EDUC, V71, P387
[3]  
Frieder S., 2023, ARXIV
[4]   How Does ChatGPT Perform on the United States Medical Licensing Examination (USMLE)? The Implications of Large Language Models for Medical Education and Knowledge Assessment [J].
Gilson, Aidan ;
Safranek, Conrad W. ;
Huang, Thomas ;
Socrates, Vimig ;
Chi, Ling ;
Taylor, Richard Andrew ;
Chartash, David .
JMIR MEDICAL EDUCATION, 2023, 9
[5]   Performance of ChatGPT on USMLE: Potential for AI-assisted medical education using large language models [J].
Kung, Tiffany H. ;
Cheatham, Morgan ;
Medenilla, Arielle ;
Sillos, Czarina ;
De Leon, Lorie ;
Elepano, Camille ;
Madriaga, Maria ;
Aggabao, Rimel ;
Diaz-Candido, Giezel ;
Maningo, James ;
Tseng, Victor .
PLOS DIGITAL HEALTH, 2023, 2 (02)
[6]  
Nori H., 2023, ARXIV
[7]  
Ray P.P., 2023, Internet of Things and Cyber-Physical Systems, V3, P121, DOI DOI 10.1016/J.IOTCPS.2023.04.003
[8]  
Terwiesch C, 2023, A prediction based on its performance in the operations management course
[9]  
Varanasi Lakshmi ., 2023, Business Insider
[10]  
Wardat Y., 2023, Eurasia Journal of Mathematics, Science and Technology Education, V19, pm2286, DOI DOI 10.29333/EJMSTE/13272