Charting new AI education in gastroenterology: Cross-sectional evaluation of ChatGPT and perplexity AI in medical residency exam

被引：18

作者：

Gravina, Antonietta Gerarda ^{[1
]}

Pellegrino, Raffaele ^{[1
]}

Palladino, Giovanna ^{[1
]}

Imperio, Giuseppe ^{[1
]}

Ventura, Andrea ^{[1
]}

Federico, Alessandro ^{[1
]}

机构：

[1] Univ Campania Luigi Vanvitelli, Dept Precis Med, Hepatogastroenterol Div, Via Luigi de Crecchio, I-80138 Naples, Italy

来源：

DIGESTIVE AND LIVER DISEASE | 2024年 / 56卷 / 08期

关键词：

Artificial intelligence; Chatbots; Education; Medical residency;

D O I：

10.1016/j.dld.2024.02.019

中图分类号：

R57 [消化系及腹部疾病];

学科分类号：

摘要：

Background: Conversational chatbots, fueled by large language models, spark debate over their potential in education and medical career exams. There is debate in the literature about the scientific integrity of the outputs produced by these chatbots. Aims: This study evaluates ChatGPT 3.5 and Perplexity AI's cross-sectional performance in responding to questions from the 2023 Italian national residency admission exam (SSM23), comparing results and chatbots' concordance with previous years SSMs. Methods: Gastroenterology-related SSM23 questions were input into ChatGPT 3.5 and Perplexity AI, evaluating their performance in correct responses and total scores. This process was repeated with questions from the three preceding years. Additionally, chatbot concordance was assessed using Cohen's method. Results: In SSM23, ChatGPT 3.5 outperforms Perplexity AI with 94.11% correct responses, demonstrating consistency across years. Concordance weakened in 2023 ( kappa= 0.203, P = 0.148), but ChatGPT consistently maintains a high standard compared to Perplexity AI. Conclusion: ChatGPT 3.5 and Perplexity AI exhibit promise in addressing gastroenterological queries, emphasizing potential educational roles. However, their variable performance mandates cautious use as supplementary tools alongside conventional study methods. Clear guidelines are crucial for educators to balance traditional approaches and innovative systems, enhancing educational standards. (c) 2024 Editrice Gastroenterologica Italiana S.r.l. Published by Elsevier Ltd. All rights reserved.

引用

页码：1304 / 1311

页数：8

共 20 条

[1] Exploring ChatGPT for information of cardiopulmonary resuscitation
Ahn, Chiwon
[J]. RESUSCITATION, 2023, 185
[2] Artificial Hallucinations in ChatGPT: Implications in Scientific Writing
Alkaissi, Hussam
McFarlane, Samy I.
[J]. CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (02)
[3] How Does ChatGPT Perform on the Italian Residency Admission National Exam Compared to 15,869 Medical Graduates?
Bonetti, Mario Alessandri
Giorgino, Riccardo
Afflitto, Gabriele Gallo
De Lorenzi, Francesca
Egro, Francesco M.
[J]. ANNALS OF BIOMEDICAL ENGINEERING, 2024, 52 (04) : 745 - 749
[4] Evaluating the Feasibility of ChatGPT in Healthcare: An Analysis of Multiple Clinical and Research Scenarios
Cascella, Marco
Montomoli, Jonathan
Bellini, Valentina
Bignami, Elena
[J]. JOURNAL OF MEDICAL SYSTEMS, 2023, 47 (01)
[5] Artificial intelligence-based text generators in hepatology: ChatGPT is just the beginning
Ge, Jin
Lai, Jennifer C.
[J]. HEPATOLOGY COMMUNICATIONS, 2023, 7 (04)
[6] A SWOT (Strengths, Weaknesses, Opportunities, and Threats) Analysis of ChatGPT in the Medical Literature: Concise Review
Goedde, Daniel
Noehl, Sophia
Wolf, Carina
Rupert, Yannick
Rimkus, Lukas
Ehlers, Jan
Breuckmann, Frank
Sellmann, Timur
[J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2023, 25
[7] May ChatGPT be a tool producing medical information for common inflammatory bowel disease patients' questions? An evidence-controlled analysis
Gravina, Antonietta Gerarda
Pellegrino, Raffaele
Cipullo, Marina
Palladino, Giovanna
Imperio, Giuseppe
Ventura, Andrea
Auletta, Salvatore
Ciamarra, Paola
Federico, Alessandro
[J]. WORLD JOURNAL OF GASTROENTEROLOGY, 2024, 30 (01) : 17 - 33
[8] Evaluating the Efficacy of ChatGPT in Navigating the Spanish Medical Residency Entrance Examination (MIR): Promising Horizons for AI in Clinical Medicine
Guillen-Grima, Francisco
Guillen-Aguinaga, Sara
Guillen-Aguinaga, Laura
Alas-Brun, Rosa
Onambele, Luc
Ortega, Wilfrido
Montejo, Rocio
Aguinaga-Ontoso, Enrique
Barach, Paul
Aguinaga-Ontoso, Ines
[J]. CLINICS AND PRACTICE, 2023, 13 (06) : 1460 - 1487
[9] Survey of Hallucination in Natural Language Generation
Ji, Ziwei
Lee, Nayeon
Frieske, Rita
Yu, Tiezheng
Su, Dan
Xu, Yan
Ishii, Etsuko
Bang, Ye Jin
Madotto, Andrea
Fung, Pascale
[J]. ACM COMPUTING SURVEYS, 2023, 55 (12)
[10] Use of ChatGPT on Taiwan's Examination for Medical Doctors
Kao, Yung-Shuo
Chuang, Wei-Kai
Yang, Jen
[J]. ANNALS OF BIOMEDICAL ENGINEERING, 2024, 52 (03) : 455 - 457

← 1 2 →