ScholarGPT's performance in oral and maxillofacial surgery

被引:2
作者
Balel, Yunus [1 ]
机构
[1] Sivas Cumhuriyet Univ, Fac Dent, Dept Oral & Maxillofacial Surg, TR-58000 Sivas, Turkiye
关键词
Artificial intelligence; GPT; Quality; CHALLENGES;
D O I
10.1016/j.jormas.2024.102114
中图分类号
R78 [口腔科学];
学科分类号
1003 ;
摘要
Objective: The purpose of this study is to evaluate the performance of Scholar GPT in answering technical questions in the field of oral and maxillofacial surgery and to conduct a comparative analysis with the results of a previous study that assessed the performance of ChatGPT. Materials and Methods: Scholar GPT was accessed via ChatGPT (www.chatgpt.com) on March 20, 2024. A total of 60 technical questions (15 each on impacted teeth, dental implants, temporomandibular joint disorders, and orthognathic surgery) from our previous study were used. Scholar GPT's responses were evaluated using a modified Global Quality Scale (GQS). The questions were randomized before scoring using an online randomizer (www.randomizer.org). A single researcher performed the evaluations at three different times, three weeks apart, with each evaluation preceded by a new randomization. In cases of score discrepancies, a fourth evaluation was conducted to determine the final score. Results: Scholar GPT performed well across all technical questions, with an average GQS score of 4.48 (SD=0.93). Comparatively, ChatGPT's average GQS score in previous study was 3.1 (SD=1.492). The Wilcoxon Signed-Rank Test indicated a statistically significant higher average score for Scholar GPT compared to ChatGPT (Mean Difference = 2.00, SE = 0.163, p < 0.001). The Kruskal-Wallis Test showed no statistically significant differences among the topic groups (x(2) = 0.799, df= 3, p = 0.850, epsilon(2) = 0.0135). Conclusion: Scholar GPT demonstrated a generally high performance in technical questions within oral and maxillofacial surgery and produced more consistent and higher-quality responses compared to ChatGPT. The findings suggest that GPT models based on academic databases can provide more accurate and reliable information. Additionally, developing a specialized GPT model for oral and maxillofacial surgery could ensure higher quality and consistency in artificial intelligence-generated information. (c) 2024 Elsevier Masson SAS. All rights are reserved, including those for text and data mining, AI training, and similar technologies.
引用
收藏
页数:4
相关论文
共 22 条
  • [1] Aydin N, 2022, 2022 3 INT INF SOFTW, P1
  • [2] Comparison Between ChatGPT and Google Search as Sources of Postoperative Patient Instructions
    Ayoub, Noel F.
    Lee, Yu-Jin
    Grimm, David
    Balakrishnan, Karthik
    [J]. JAMA OTOLARYNGOLOGY-HEAD & NECK SURGERY, 2023, 149 (06) : 556 - +
  • [3] Can ChatGPT be used in oral and maxillofacial surgery?
    Balel, Yunus
    [J]. JOURNAL OF STOMATOLOGY ORAL AND MAXILLOFACIAL SURGERY, 2023, 124 (05)
  • [4] Bohr A, 2020, Artificial Intelligence in Healthcare, P25, DOI DOI 10.1016/B978-0-12-818438-7.00002-2
  • [5] Chowdhary KR., 2020, FUNDAMENTALS ARTIFIC, P603, DOI [DOI 10.1007/978-81-322-3972-7_19, DOI 10.1007/978-81-322-3972-719]
  • [6] Artificial intelligence for decision making in the era of Big Data - evolution, challenges and research agenda
    Duan, Yanqing
    Edwards, John S.
    Dwivedi, Yogesh K.
    [J]. INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT, 2019, 48 : 63 - 71
  • [7] GARFIELD E, 1974, CURR CONTENTS, P5
  • [8] Gromova E.A., 2023, REV BRASILEIRA ALTER, V5, P153, DOI DOI 10.52028/RBADR.V5I10
  • [9] Habuza Tetiana, 2021, Informatics in Medicine Unlocked, V24, DOI 10.1016/j.imu.2021.100596
  • [10] Teaching and learning in digital environments: The resurgence of resource-based learning
    Hill, JR
    Hannafin, MJ
    [J]. ETR&D-EDUCATIONAL TECHNOLOGY RESEARCH AND DEVELOPMENT, 2001, 49 (03): : 37 - 52