ChatGPT efficacy for answering musculoskeletal anatomy questions: a study evaluating quality and consistency between raters and timepoints

被引:3
作者
Mantzou, Nikolaos [1 ]
Ediaroglou, Vasileios [1 ]
Drakonaki, Elena [2 ]
Syggelos, Spyros A. [3 ]
Karageorgos, Filippos F. [1 ]
Totlis, Trifon [4 ]
机构
[1] Aristotle Univ Thessaloniki, Fac Hlth Sci, Sch Med, Thessaloniki 54124, Greece
[2] Clin Radiologist Univ Crete, Dept Anat, Iraklion, Greece
[3] Univ Patras, Sch Med, Dept Anat Histol Embryol, Patras, Greece
[4] Aristotle Univ Thessaloniki, Fac Hlth Sci, Sch Med, Dept Anat & Surg Anat, Thessaloniki 54124, Greece
关键词
ChatGPT; Anatomy; Artificial intelligence; Large language models;
D O I
10.1007/s00276-024-03477-9
中图分类号
R602 [外科病理学、解剖学]; R32 [人体形态学];
学科分类号
100101 ;
摘要
PurposeThere is increasing interest in the use of digital platforms such as ChatGPT for anatomy education. This study aims to evaluate the efficacy of ChatGPT in providing accurate and consistent responses to questions focusing on musculoskeletal anatomy across various time points (hours and days).MethodsA selection of 6 Anatomy-related questions were asked to ChatGPT 3.5 in 4 different timepoints. All answers were rated blindly by 3 expert raters for quality according to a 5 -point Likert Scale. Difference of 0 or 1 points in Likert scale scores between raters was considered as agreement and between different timepoints was considered as consistent indicating good reproducibility.ResultsThere was significant variation in the quality of the answers ranging from extremely good to very poor quality. There was also variation of consistency levels between different timepoints. Answers were rated as good quality (>= 3 in Likert scale) in 50% of cases (3/6) and as consistent in 66.6% (4/6) of cases. In the low-quality answers, significant mistakes, conflicting data or lack of information were encountered.ConclusionAs of the time of this article, the quality and consistency of the ChatGPT v3.5 answers is variable, thus limiting its utility as independent and reliable resource of learning musculoskeletal anatomy. Validating information by reviewing the anatomical literature is highly recommended.
引用
收藏
页码:1885 / 1890
页数:6
相关论文
共 19 条
[1]   Case report: absence of the right piriformis muscle in a woman [J].
Brenner, Erich ;
Tripoli, Massimiliano ;
Scavo, Elia ;
Cordova, Adriana .
SURGICAL AND RADIOLOGIC ANATOMY, 2019, 41 (07) :845-848
[2]   How Does ChatGPT Perform on the United States Medical Licensing Examination (USMLE)? The Implications of Large Language Models for Medical Education and Knowledge Assessment [J].
Gilson, Aidan ;
Safranek, Conrad W. ;
Huang, Thomas ;
Socrates, Vimig ;
Chi, Ling ;
Taylor, Richard Andrew ;
Chartash, David .
JMIR MEDICAL EDUCATION, 2023, 9
[3]   Influence on the accuracy in ChatGPT: Differences in the amount of information per medical field [J].
Haze, Tatsuya ;
Kawano, Rina ;
Takase, Hajime ;
Suzuki, Shota ;
Hirawa, Nobuhito ;
Tamura, Kouichi .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2023, 180
[4]   ChatGPT Is Equivalent to First-Year Plastic Surgery Residents: Evaluation of ChatGPT on the Plastic Surgery In-service Examination [J].
Humar, Pooja ;
Asaad, Malke ;
Bengur, Fuat Baris ;
Nguyen, Vu .
AESTHETIC SURGERY JOURNAL, 2023, 43 (12) :NP1085-NP1089
[5]  
Hyland S, 2023, STATPEARLS
[6]   The Significance of Artificial Intelligence Platforms in Anatomy Education: An Experience With ChatGPT and Google Bard [J].
Ilgaz, Hasan B. ;
Celik, Zehra .
CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (09)
[7]  
Johnson Douglas, 2023, Res Sq, DOI 10.21203/rs.3.rs-2566942/v1
[8]   The Advent of Generative Language Models in Medical Education [J].
Karabacak, Mert ;
Ozkara, Burak Berksu ;
Margetis, Konstantinos ;
Wintermark, Max ;
Bisdas, Sotirios .
JMIR MEDICAL EDUCATION, 2023, 9
[9]   Challenge, integration, and change: ChatGPT and future anatomical education [J].
Leng, Lige .
MEDICAL EDUCATION ONLINE, 2024, 29 (01)
[10]   Initial impressions of ChatGPT for anatomy education [J].
Mogali, Sreenivasulu Reddy .
ANATOMICAL SCIENCES EDUCATION, 2024, 17 (02) :444-447