Evaluation of ChatGPT's Performance in the Turkish Board of Orthopaedic Surgery Examination

被引:0
|
作者
Yigitbay, Ahmet [1 ]
机构
[1] Siverek State Hosp, Clin Orthoped & Traumatol, Sanliurfa, Turkiye
来源
HASEKI TIP BULTENI-MEDICAL BULLETIN OF HASEKI | 2024年 / 62卷 / 04期
关键词
Artificial intelligence; humans; orthopedics; specialty boards; ARTIFICIAL-INTELLIGENCE;
D O I
10.4274/haseki.galenos.2024.10038
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Aim: Technological advances lead to significant changes in education and evaluation processes in medicine. In particular, artificial intelligence and natural language processing developments offer new opportunities in the health sector. This article evaluates Chat Generative Pre-Trained Transformer's (ChatGPT) performance in the Turkish Orthopaedics and Traumatology Education Council (TOTEK) Qualifying Written Examination and its applicability. Methods: To evaluate ChatGPT's performance, TOTEK Qualifying Written Examination questions from the last five years were entered as data. The results of ChatGPT were assessed under four parameters and compared with the actual exam results. The results were analyzed statistically. Results: Of the 500 questions, 458 were used as data in this study. Chat Generative Pre-Trained Transformer scored 40.2%, 26.3%, 37.3%, 32.9%, and 35.8% in the 2019, 2020, 2021, 2022, and 2023 TOTEK Qualifying Written Examination, respectively. When the correct answer percentages of ChatGPT according to years and the simple linear regression model applied to these data were analyzed, it was determined that there was a slightly decreasing trend in the correct answer rates as the years progressed. ChatGPT's TOTEK Qualifying Written Examination performance showed a statistically significant difference from the actual exam results. It was observed that the correct answer percentage of ChatGPT was below the general average success scores of the exam for each year. Conclusions: This analysis of artificial intelligence's applicability in the field and its role in training processes is essential to assess ChatGPT's potential uses and limitations. Chat Generative Pre-Trained Transformer can be a training tool, especially for knowledgebased and logical questions on specific topics. Still, its current performance is not at a level that can replace human decision-making in specialized medical fields.
引用
收藏
页码:243 / 249
页数:7
相关论文
共 50 条
  • [41] A Comprehensive Examination of ChatGPT's Contribution to the Healthcare Sector and Hepatology
    Kumari, Kabita
    Pahuja, Sharvan Kumar
    Kumar, Sanjeev
    DIGESTIVE DISEASES AND SCIENCES, 2024, : 4027 - 4043
  • [42] <hr>Inadequate Performance of ChatGPT on Orthopedic Board-Style Written Exams
    Sparks, Chandler A.
    Kraeutler, Matthew J.
    Chester, Grace A.
    Contrada, Edward, V
    Zhu, Eric
    Fasulo, Sydney M.
    Scillia, Anthony J.
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2024, 16 (06)
  • [43] Performance of ChatGPT and Bard in self-assessment questions for nephrology board renewal
    Noda, Ryunosuke
    Izaki, Yuto
    Kitano, Fumiya
    Komatsu, Jun
    Ichikawa, Daisuke
    Shibagaki, Yugo
    CLINICAL AND EXPERIMENTAL NEPHROLOGY, 2024, 28 (05) : 465 - 469
  • [44] Performance of ChatGPT on the Chinese Postgraduate Examination for Clinical Medicine: Survey Study
    Yu, Peng
    Fang, Changchang
    Liu, Xiaolin
    Fu, Wanying
    Ling, Jitao
    Yan, Zhiwei
    Jiang, Yuan
    Cao, Zhengyu
    Wu, Maoxiong
    Chen, Zhiteng
    Zhu, Wengen
    Zhang, Yuling
    Abudukeremu, Ayiguli
    Wang, Yue
    Liu, Xiao
    Wang, Jingfeng
    JMIR MEDICAL EDUCATION, 2024, 10
  • [45] Evaluating ChatGPT-4's Performance in Identifying Radiological Anatomy in FRCR Part 1 Examination Questions
    Sarangi, Pradosh Kumar
    Datta, Suvrankar
    Panda, Braja Behari
    Panda, Swaha
    Mondal, Himel
    INDIAN JOURNAL OF RADIOLOGY AND IMAGING, 2024,
  • [46] ChatGPT Performance Evaluation on Chinese Language and Risk Measures
    Zhang H.
    Li L.
    Li C.
    Data Analysis and Knowledge Discovery, 2023, 7 (03) : 16 - 25
  • [47] ChatGPT's Success in the Board-Certified Pharmacotherapy Specialist (BCPS) Exam
    Al-Worafi, Yaser Mohammed
    Chooi, Wen Han
    Tan, Ching Siang
    Lua, Pei Lin
    Farrukh, Muhammad Junaid
    Zulkifly, Hanis Hanum
    Ming, Long Chiau
    JOURNAL OF RESEARCH IN PHARMACY, 2024, 28 (03): : 674 - 678
  • [48] A Novel Evaluation Model for Assessing ChatGPT on Otolaryngology-Head and Neck Surgery Certification Examinations: Performance Study
    Long, Cai
    Lowe, Kayle
    Zhang, Jessica
    dos Santos, Andre
    Alanazi, Alaa
    O'Brien, Daniel
    Wright, Erin
    Cote, David
    JMIR MEDICAL EDUCATION, 2024, 10
  • [49] Assessment Study of ChatGPT-3.5's Performance on the Final Polish Medical Examination: Accuracy in Answering 980 Questions
    Siebielec, Julia
    Ordak, Michal
    Oskroba, Agata
    Dworakowska, Anna
    Bujalska-Zadrozny, Magdalena
    HEALTHCARE, 2024, 12 (16)
  • [50] Evaluation of the performance of ChatGPT-4 and ChatGPT-4o as a learning tool in endodontics
    Ozturk, Esra Arili
    Gokduman, Ceren Turan
    Canakci, Burhan Can
    INTERNATIONAL ENDODONTIC JOURNAL, 2025,