Can Large Language Models Serve as Reliable Tools for Information in Dentistry? A Systematic Review

被引:0
作者
Alhazmi, Nora [1 ]
Alshehri, Aram [2 ]
Bahammam, Fahad [2 ]
Philip, Manju [3 ]
Nadeem, Muhammad [4 ]
Khanagar, Sanjeev [1 ]
机构
[1] King Saud bin Abdulaziz Univ Hlth Sci, King Abdullah Int Med Res Ctr, Dept Prevent Dent Sci, Minist Natl Guard Hlth Affairs,Coll Dent, Riyadh, Saudi Arabia
[2] Minist Natl Guard Hlth Affairs, King Abdullah Int Med Res Ctr, Hlth Sci, Riyadh, Saudi Arabia
[3] King Saud bin Abdulaziz Univ Hlth Sci, King Abdullah Int Med Res Ctr, Dept Maxillofacial Surg & Diagnost Sci, Minist Natl Guard Hlth Affairs, Riyadh, Saudi Arabia
[4] King Saud bin Abdulaziz Univ Hlth Sci, Coll Dent, King Abdullah Int Med Res Ctr, Minist Natl Guard Hlth Affairs, Riyadh, Saudi Arabia
关键词
Large language models; Dentistry; Performance; Accuracy; ARTIFICIAL-INTELLIGENCE; CHATGPT; QUESTIONS; RESPONSES;
D O I
10.1016/j.identj.2025.04.015
中图分类号
R78 [口腔科学];
学科分类号
1003 ;
摘要
Large language models (LLMs) have gained popularity among dental students for generating subject-related answers. However, their widespread use raises significant concerns about misinformation. This systematic review aims to critically evaluate studies assessing the performance of LLMs in dentistry. A comprehensive electronic search was conducted in PubMed/ Medline, Scopus, Embase, Web of Science, Google Scholar, and the Saudi Digital Library to identify studies published up to September 2024. The study quality was assessed using the Prediction Model Risk of Bias Assessment Tool (PROBAST). A total of 2030 studies have been identified. After removing 907 duplicate records, 1123 studies remained for screening. Ultimately, 31 studies met the inclusion criteria. Approximately half of these studies were classified as "high risk," while the remainder were classified as "low risk." The applicability of the findings was rated as "low concern." The primary limitations of LLMs include their inability to specify information sources and their tendency to generate fabricated citations. Based on this review, LLMs hold promise as supplementary educational tools in dentistry. Evidence indicates that students using LLMs may achieve improved academic performance compared to traditional methods. However, concerns about occasional inaccuracies and unreliable citations underscore the need for further research, integration with validated sources, and adherence to ethical guidelines. Ultimately, LLMs should be viewed as complementary tools within dental education, with careful consideration of their limitations. (c) 2025 The Authors. Published by Elsevier Inc. on behalf of FDI World Dental Federation. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
引用
收藏
页数:16
相关论文
共 57 条
[1]   Large Language Models in Medical Education: Opportunities, Challenges, and Future Directions [J].
Abd-alrazaq, Alaa ;
AlSaad, Rawan ;
Alhuwail, Dari ;
Ahmed, Arfan ;
Healy, Padraig Mark ;
Latifi, Syed ;
Aziz, Sarah ;
Damseh, Rafat ;
Alrazak, Sadam Alabed ;
Sheikh, Javaid .
JMIR MEDICAL EDUCATION, 2023, 9
[2]   Comparing Artificial Intelligence and Senior Residents in Oral Lesion Diagnosis: A Comparative Study [J].
Albagieh, Hamad ;
Alzeer, Zaid O. ;
Alasmari, Osama N. ;
Alkadhi, Abdullah A. ;
Naitah, Abdulaziz N. ;
Almasaad, Khaled F. ;
Alshahrani, Turki S. ;
Alshahrani, Khalid S. ;
Almahmoud, Mohammed I. .
CUREUS JOURNAL OF MEDICAL SCIENCE, 2024, 16 (01)
[3]   ChatGPT-A double-edged sword for healthcare education? Implications for assessments of dental students [J].
Ali, Kamran ;
Barhom, Noha ;
Tamimi, Faleh ;
Duggal, Monty .
EUROPEAN JOURNAL OF DENTAL EDUCATION, 2024, 28 (01) :206-211
[4]   Evaluation of AI-generated responses by different artificial intelligence chatbots to the clinical decision-making case-based questions in oral and maxillofacial surgery [J].
Azadi, Ali ;
Gorjinejad, Fatemeh ;
Mohammad-Rahimi, Hossein ;
Tabrizi, Reza ;
Alam, Mostafa ;
Golkar, Mohsen .
ORAL SURGERY ORAL MEDICINE ORAL PATHOLOGY ORAL RADIOLOGY, 2024, 137 (06) :587-593
[5]  
BaHammam F, 2024, OSF, P30, DOI [10.17605/OSF.IO/J4V8C, DOI 10.17605/OSF.IO/J4V8C]
[6]   Artificial Intelligence in Medicine [J].
Beam, Andrew L. ;
Drazen, Jeffrey M. ;
Kohane, Isaac S. ;
Leong, Tze-Yun ;
Manrai, Arjun K. ;
Rubin, Eric J. .
NEW ENGLAND JOURNAL OF MEDICINE, 2023, 388 (13) :1220-1221
[7]   Assessing the performance of Bing Chat artificial intelligence: Dental exams, clinical guidelines, and patients' frequent questions [J].
Brozovic, Juraj ;
Mikulic, Barbara ;
Tomas, Matej ;
Juzbas, Martina ;
Blaskovic, Marko .
JOURNAL OF DENTISTRY, 2024, 144
[8]   Performance of Generative Artificial Intelligence in Dental Licensing Examinations [J].
Chau, Reinhard Chun Wang ;
Thu, Khaing Myat ;
Yu, Ollie Yiru ;
Hsung, Richard Tai-Chiu ;
Lo, Edward Chin Man ;
Lam, Walter Yu Hang .
INTERNATIONAL DENTAL JOURNAL, 2024, 74 (03) :616-621
[9]   Artificial intelligence in dental education: ChatGPT's performance on the periodontic in-service examination [J].
Danesh, Arman ;
Pazouki, Hirad ;
Danesh, Farzad ;
Danesh, Arsalan ;
Vardar-Sengul, Saynur .
JOURNAL OF PERIODONTOLOGY, 2024, 95 (07) :682-687
[10]   The performance of arti fi cial intelligence models in generating responses to general orthodontic questions: ChatGPT vs Google Bard [J].
Daraqel, Baraa ;
Wafaie, Khaled ;
Mohammed, Hisham ;
Cao, Li ;
Mheissen, Samer ;
Liu, Yang ;
Zheng, Leilei .
AMERICAN JOURNAL OF ORTHODONTICS AND DENTOFACIAL ORTHOPEDICS, 2024, 165 (06) :652-662