Variability in Large Language Models' Responses to Medical Licensing and Certification Examinations. Comment on "How Does ChatGPT Perform on the United States Medical Licensing Examination? The Implications of Large Language Models for Medical Education and Knowledge Assessment"

被引:13
作者
Epstein, Richard H. [1 ,3 ]
Dexter, Franklin [2 ]
机构
[1] Univ Miami, Dept Anesthesiol Perioperat Med & Pain Management, Miller Sch Med, Miami, FL USA
[2] Univ Iowa, Dept Anesthesia, Div Management Consulting, Iowa City, IA USA
[3] Univ Miami, Dept Anesthesiol Perioperat Med & Pain Management, Miller Sch Med, 1400 NW 12th Ave,Suite 4022F, Miami, FL 33136 USA
关键词
natural language processing; NLP; MedQA; generative pre-trained transformer; GPT; medical education; chatbot; artificial intelligence; AI; education technology; ChatGPT; Google Bard; conversational agent; machine learning; large language models; knowledge assessment;
D O I
10.2196/48305
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
引用
收藏
页数:2
相关论文
共 3 条
[1]   How Does ChatGPT Perform on the United States Medical Licensing Examination (USMLE)? The Implications of Large Language Models for Medical Education and Knowledge Assessment [J].
Gilson, Aidan ;
Safranek, Conrad W. ;
Huang, Thomas ;
Socrates, Vimig ;
Chi, Ling ;
Taylor, Richard Andrew ;
Chartash, David .
JMIR MEDICAL EDUCATION, 2023, 9
[2]  
google, GOOGL SEARCH HELP
[3]  
McEvoy M, 2020, WEB PRESENCE SOLUTIO