ScholarGPT's performance in oral and maxillofacial surgery

被引：2

作者：

Balel, Yunus ^{[1
]}

机构：

[1] Sivas Cumhuriyet Univ, Fac Dent, Dept Oral & Maxillofacial Surg, TR-58000 Sivas, Turkiye

来源：

JOURNAL OF STOMATOLOGY ORAL AND MAXILLOFACIAL SURGERY | 2025年 / 126卷 / 04期

关键词：

Artificial intelligence; GPT; Quality; CHALLENGES;

D O I：

10.1016/j.jormas.2024.102114

中图分类号：

R78 [口腔科学];

学科分类号：

1003 ;

摘要：

Objective: The purpose of this study is to evaluate the performance of Scholar GPT in answering technical questions in the field of oral and maxillofacial surgery and to conduct a comparative analysis with the results of a previous study that assessed the performance of ChatGPT. Materials and Methods: Scholar GPT was accessed via ChatGPT (www.chatgpt.com) on March 20, 2024. A total of 60 technical questions (15 each on impacted teeth, dental implants, temporomandibular joint disorders, and orthognathic surgery) from our previous study were used. Scholar GPT's responses were evaluated using a modified Global Quality Scale (GQS). The questions were randomized before scoring using an online randomizer (www.randomizer.org). A single researcher performed the evaluations at three different times, three weeks apart, with each evaluation preceded by a new randomization. In cases of score discrepancies, a fourth evaluation was conducted to determine the final score. Results: Scholar GPT performed well across all technical questions, with an average GQS score of 4.48 (SD=0.93). Comparatively, ChatGPT's average GQS score in previous study was 3.1 (SD=1.492). The Wilcoxon Signed-Rank Test indicated a statistically significant higher average score for Scholar GPT compared to ChatGPT (Mean Difference = 2.00, SE = 0.163, p < 0.001). The Kruskal-Wallis Test showed no statistically significant differences among the topic groups (x(2) = 0.799, df= 3, p = 0.850, epsilon(2) = 0.0135). Conclusion: Scholar GPT demonstrated a generally high performance in technical questions within oral and maxillofacial surgery and produced more consistent and higher-quality responses compared to ChatGPT. The findings suggest that GPT models based on academic databases can provide more accurate and reliable information. Additionally, developing a specialized GPT model for oral and maxillofacial surgery could ensure higher quality and consistency in artificial intelligence-generated information. (c) 2024 Elsevier Masson SAS. All rights are reserved, including those for text and data mining, AI training, and similar technologies.

引用

页数：4

共 22 条

[1] Aydin N, 2022, 2022 3 INT INF SOFTW, P1
[2] Comparison Between ChatGPT and Google Search as Sources of Postoperative Patient Instructions
Ayoub, Noel F.
Lee, Yu-Jin
Grimm, David
Balakrishnan, Karthik
[J]. JAMA OTOLARYNGOLOGY-HEAD & NECK SURGERY, 2023, 149 (06) : 556 - +
[3] Can ChatGPT be used in oral and maxillofacial surgery?
Balel, Yunus
[J]. JOURNAL OF STOMATOLOGY ORAL AND MAXILLOFACIAL SURGERY, 2023, 124 (05)
[4] Bohr A, 2020, Artificial Intelligence in Healthcare, P25, DOI DOI 10.1016/B978-0-12-818438-7.00002-2
[5] Chowdhary KR., 2020, FUNDAMENTALS ARTIFIC, P603, DOI [DOI 10.1007/978-81-322-3972-7_19, DOI 10.1007/978-81-322-3972-719]
[6] Artificial intelligence for decision making in the era of Big Data - evolution, challenges and research agenda
Duan, Yanqing
Edwards, John S.
Dwivedi, Yogesh K.
[J]. INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT, 2019, 48 : 63 - 71
[7] GARFIELD E, 1974, CURR CONTENTS, P5
[8] Gromova E.A., 2023, REV BRASILEIRA ALTER, V5, P153, DOI DOI 10.52028/RBADR.V5I10
[9] Habuza Tetiana, 2021, Informatics in Medicine Unlocked, V24, DOI 10.1016/j.imu.2021.100596
[10] Teaching and learning in digital environments: The resurgence of resource-based learning
Hill, JR
Hannafin, MJ
[J]. ETR&D-EDUCATIONAL TECHNOLOGY RESEARCH AND DEVELOPMENT, 2001, 49 (03): : 37 - 52

← 1 2 3 →