Enhancements in artificial intelligence for medical examinations: A leap from ChatGPT 3.5 to ChatGPT 4.0 in the FRCS trauma & orthopaedics examination

被引:0
作者
Khan, Akib Majed [1 ]
Sarraf, Khaled Maher [1 ]
Simpson, Ashley Iain [2 ]
机构
[1] Imperial Coll Healthcare NHS Trust, Praed St, London W2 1NY, England
[2] Royal Natl Orthopaed Hosp, Brockley Hill, Stanmore HA7 4LP, England
来源
SURGEON-JOURNAL OF THE ROYAL COLLEGES OF SURGEONS OF EDINBURGH AND IRELAND | 2025年 / 23卷 / 01期
关键词
Artificial intelligence; ChatGPT; FRCS; Trauma & orthopaedics; Medical education;
D O I
10.1016/j.surge.2024.11.008
中图分类号
R61 [外科手术学];
学科分类号
摘要
Introduction: ChatGPT is a sophisticated AI model capable of generating human-like text based on the input it receives. ChatGPT 3.5 showed an inability to pass the FRCS (Tr&Orth) examination due to a lack of higher-order judgement in previous studies. Enhancements in ChatGPT 4.0 warrant an evaluation of its performance. Methodology: Questions from the UK-based December 2022 In-Training examination were input into ChatGPT 3.5 and 4.0. Methodology from a prior study was replicated to maintain consistency, allowing for a direct comparison between the two model versions. The performance threshold remained at 65.8 %, aligning with the November 2022 sitting of Section 1 of the FRCS (Tr&Orth).<br /> Results: ChatGPT 4.0 achieved a passing score (73.9 %), indicating an improvement in its ability to analyse clinical information and make decisions reflective of a competent trauma and orthopaedic consultant. Compared to ChatGPT 4.0, version 3.5 scored 38.1 % lower, which represents a significant difference (p < 0.0001; Chisquare). The breakdown by subspecialty further demonstrated version 4.0's enhanced understanding and application in complex clinical scenarios. ChatGPT 4.0 had a significantly significant improvement in answering image-based questions (p = 0.0069) compared to its predecessor.<br /> Conclusion: ChatGPT 4.0's success in passing Section One of the FRCS (Tr&Orth) examination highlights the rapid evolution of AI technologies and their potential applications in healthcare and education.
引用
收藏
页码:13 / 17
页数:5
相关论文
共 8 条
  • [1] ChatGPT sitting for FRCS Urology examination: Will artificial intelligence get certified?
    Desouky, Elsayed
    Jallad, Samer
    Bhardwa, Jeetesh
    Sharma, Harbinder
    Kalsi, Jas
    JOURNAL OF CLINICAL UROLOGY, 2024,
  • [2] Can ChatGPT 4.0 Diagnose Acute Aortic Dissection? Integrating Artificial Intelligence into Medical Diagnostics
    Goyal, Aman
    Tariq, Muhammad Daoud
    Ahsan, Areeba
    Brateanu, Andrei
    AMERICAN JOURNAL OF CARDIOLOGY, 2025, 239 : 90 - 92
  • [3] Evaluation of the quality and quantity of artificial intelligence-generated responses about anesthesia and surgery: using ChatGPT 3.5 and 4.0
    Choi, Jisun
    Oh, Ah Ran
    Park, Jungchan
    Kang, Ryung A.
    Yoo, Seung Yeon
    Lee, Dong Jae
    Yang, Kwangmo
    FRONTIERS IN MEDICINE, 2024, 11
  • [4] Artificial intelligence in orthopaedics: can Chat Generative Pre-trained Transformer (ChatGPT) pass Section 1 of the Fellowship of the Royal College of Surgeons (Trauma & Orthopaedics) examination?
    Cuthbert, Rory
    Simpson, Ashley, I
    POSTGRADUATE MEDICAL JOURNAL, 2023, 99 (1176) : 1110 - 1114
  • [5] Artificial Intelligence and Objective Structured Clinical Examinations: Using ChatGPT to Revolutionize Clinical Skills Assessment in Medical Education
    Misra, Sanghamitra M.
    Suresh, Srinivasan
    JOURNAL OF MEDICAL EDUCATION AND CURRICULAR DEVELOPMENT, 2024, 11
  • [6] Artificial fi cial Intelligence in Orthopaedics: Performance of ChatGPT on Text and Image Questions on a Complete AAOS Orthopaedic In-Training Examination (OITE)
    Hayes, Daniel S.
    Foster, Brian K.
    Makar, Gabriel
    Manzar, Shahid
    Ozdag, Yagiz
    Shultz, Mason
    Klena, Joel C.
    Grandizio, Louis C.
    JOURNAL OF SURGICAL EDUCATION, 2024, 81 (11) : 1645 - 1649
  • [7] Artificial intelligence in global health equity: an evaluation and discussion on the application of ChatGPT, in the Chinese National Medical Licensing Examination
    Tong, Wenting
    Guan, Yongfu
    Chen, Jinping
    Huang, Xixuan
    Zhong, Yuting
    Zhang, Changrong
    Zhang, Hui
    FRONTIERS IN MEDICINE, 2023, 10
  • [8] ChatGPT-4: An assessment of an upgraded artificial intelligence chatbot in the United States Medical Licensing Examination
    Mihalache, Andrew
    Huang, Ryan S.
    Popovic, Marko M.
    Muni, Rajeev H.
    MEDICAL TEACHER, 2024, 46 (03) : 366 - 372