Performance of large language models in oral and maxillofacial surgery examinations

被引:2
作者
Quah, B. [1 ,2 ]
Yong, C. W. [1 ,2 ]
Lai, C. W. M. [1 ]
Islam, I. [1 ,2 ]
机构
[1] Natl Univ Singapore, Fac Dent, 9 Lower Kent Ridge Rd, Singapore 119085, Singapore
[2] Natl Univ Ctr Oral Hlth, Discipline Oral & Maxillofacial Surg, Singapore, Singapore
关键词
Artificial intelligence; Oral surgery; Dental education; Academic performance; Dentistry;
D O I
10.1016/j.ijom.2024.06.003
中图分类号
R78 [口腔科学];
学科分类号
1003 ;
摘要
This study aimed to determine the accuracy of large language models (LLMs) in answering oral and maxillofacial surgery (OMS) multiple choice questions. A total of 259 questions from the university's question bank were answered by the LLMs (GPT-3.5, GPT-4, Llama 2, Gemini, and Copilot). The scores per category as well as the total score out of 259 were recorded and evaluated, with the passing score set at 50%. The mean overall score amongst all LLMs was 62.5%. GPT-4 performed the best (76.8%, 95% confidence interval (CI) 71.4-82.2%), followed by Copilot (72.6%, 95% CI 67.2-78.0%), GPT-3.5 (62.2%, 95% CI 56.4-68.0%), Gemini (58.7%, 95% CI 52.9-64.5%), and Llama 2 (42.5%, 95% CI 37.1-48.6%). There was a statistically significant difference between the scores of the five LLMs overall (chi(2) = 79.9, df = 4, P < 0.001) and within all categories except 'basic sciences' (P = 0.129), 'dentoalveolar and implant surgery' (P = 0.052), and 'oral medicine/pathology/radiology' (P = 0.801). The LLMs performed best in 'basic sciences' (68.9%) and poorest in 'pharmacology' (45.9%). The LLMs can be used as adjuncts in teaching, but should not be used for clinical decision-making until the models are further developed and validated.
引用
收藏
页码:881 / 886
页数:6
相关论文
共 50 条
  • [31] Prospective Implementation of Correction for Guessing in Oral and Maxillofacial Pathology Multiple-Choice Examinations: Did Student Performance Improve?
    Prihoda, Thomas J.
    Pinckard, R. Neal
    McMahan, C. Alex
    Littlefield, John H.
    Jones, Anne Cale
    JOURNAL OF DENTAL EDUCATION, 2008, 72 (10) : 1149 - 1159
  • [32] A systematic approach to improve oral and maxillofacial surgery education
    Rosen, A.
    Fors, U.
    Zary, N.
    Sejersen, R.
    Lund, B.
    EUROPEAN JOURNAL OF DENTAL EDUCATION, 2011, 15 (04) : 223 - 230
  • [33] A nationwide survey of undergraduate training in oral and maxillofacial surgery
    Seifert L.B.
    Hoefer S.H.
    Flammiger S.
    Rüsseler M.
    Thieringer F.
    Ehrenfeld M.
    Sader R.
    Oral and Maxillofacial Surgery, 2018, 22 (3) : 289 - 296
  • [34] An opinion on education and scope of oral and maxillofacial surgery in China
    Chen, Jiayi
    JOURNAL OF DENTAL SCIENCES, 2025, 20 (02) : 1317 - 1317
  • [35] Medication for Gravid and Nursing Oral and Maxillofacial Surgery Patients
    Nudell, Yoav
    Miller, Jared
    ORAL AND MAXILLOFACIAL SURGERY CLINICS OF NORTH AMERICA, 2022, 34 (01) : 201 - 212
  • [36] INNOVATIONS IN ORAL AND MAXILLOFACIAL SURGERY: BIOMIMETICS MEETS PHYSIOLOGY
    Cicconetti, A.
    Passaretti, A.
    Rastelli, C.
    Rastelli, E.
    Fausi, G.
    JOURNAL OF BIOLOGICAL REGULATORS AND HOMEOSTATIC AGENTS, 2019, 33 (05) : 1609 - 1613
  • [37] Qualitative comparison of curricula in oral and maxillofacial surgery training. Part 2: oral surgery
    Walker, T. W. M.
    Varley, T. S.
    Argiris, K.
    Magennis, P.
    BRITISH JOURNAL OF ORAL & MAXILLOFACIAL SURGERY, 2012, 50 (05) : 468 - 469
  • [38] The global reach of social media in oral and maxillofacial surgery
    Harris, Jack A.
    Beck, Nicole A.
    Niedziela, Cassi J.
    Alvarez, Gerardo A.
    Danquah, Sheridan A.
    Afshar, Salim
    ORAL AND MAXILLOFACIAL SURGERY-HEIDELBERG, 2023, 27 (03): : 513 - 517
  • [39] Efficacy/Safety of the Use of Glucocorticoids in Oral and Maxillofacial Surgery
    Nils, Heilyn Joanna
    Arce Recatala, Cristina
    Castano, Antonio
    Ribas, David
    Flores-Fraile, Javier
    DENTISTRY JOURNAL, 2023, 11 (10)
  • [40] Barriers to research among residents in oral and maxillofacial surgery
    Ho, Annie H.
    Sansevere, Matthew J.
    Chou, Joli C.
    JOURNAL OF DENTAL EDUCATION, 2024, 88 (06) : 755 - 764