Charting new AI education in gastroenterology: Cross-sectional evaluation of ChatGPT and perplexity AI in medical residency exam

被引:18
作者
Gravina, Antonietta Gerarda [1 ]
Pellegrino, Raffaele [1 ]
Palladino, Giovanna [1 ]
Imperio, Giuseppe [1 ]
Ventura, Andrea [1 ]
Federico, Alessandro [1 ]
机构
[1] Univ Campania Luigi Vanvitelli, Dept Precis Med, Hepatogastroenterol Div, Via Luigi de Crecchio, I-80138 Naples, Italy
关键词
Artificial intelligence; Chatbots; Education; Medical residency;
D O I
10.1016/j.dld.2024.02.019
中图分类号
R57 [消化系及腹部疾病];
学科分类号
摘要
Background: Conversational chatbots, fueled by large language models, spark debate over their potential in education and medical career exams. There is debate in the literature about the scientific integrity of the outputs produced by these chatbots. Aims: This study evaluates ChatGPT 3.5 and Perplexity AI's cross-sectional performance in responding to questions from the 2023 Italian national residency admission exam (SSM23), comparing results and chatbots' concordance with previous years SSMs. Methods: Gastroenterology-related SSM23 questions were input into ChatGPT 3.5 and Perplexity AI, evaluating their performance in correct responses and total scores. This process was repeated with questions from the three preceding years. Additionally, chatbot concordance was assessed using Cohen's method. Results: In SSM23, ChatGPT 3.5 outperforms Perplexity AI with 94.11% correct responses, demonstrating consistency across years. Concordance weakened in 2023 ( kappa= 0.203, P = 0.148), but ChatGPT consistently maintains a high standard compared to Perplexity AI. Conclusion: ChatGPT 3.5 and Perplexity AI exhibit promise in addressing gastroenterological queries, emphasizing potential educational roles. However, their variable performance mandates cautious use as supplementary tools alongside conventional study methods. Clear guidelines are crucial for educators to balance traditional approaches and innovative systems, enhancing educational standards. (c) 2024 Editrice Gastroenterologica Italiana S.r.l. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:1304 / 1311
页数:8
相关论文
共 20 条
  • [2] Artificial Hallucinations in ChatGPT: Implications in Scientific Writing
    Alkaissi, Hussam
    McFarlane, Samy I.
    [J]. CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (02)
  • [3] How Does ChatGPT Perform on the Italian Residency Admission National Exam Compared to 15,869 Medical Graduates?
    Bonetti, Mario Alessandri
    Giorgino, Riccardo
    Afflitto, Gabriele Gallo
    De Lorenzi, Francesca
    Egro, Francesco M.
    [J]. ANNALS OF BIOMEDICAL ENGINEERING, 2024, 52 (04) : 745 - 749
  • [4] Evaluating the Feasibility of ChatGPT in Healthcare: An Analysis of Multiple Clinical and Research Scenarios
    Cascella, Marco
    Montomoli, Jonathan
    Bellini, Valentina
    Bignami, Elena
    [J]. JOURNAL OF MEDICAL SYSTEMS, 2023, 47 (01)
  • [5] Artificial intelligence-based text generators in hepatology: ChatGPT is just the beginning
    Ge, Jin
    Lai, Jennifer C.
    [J]. HEPATOLOGY COMMUNICATIONS, 2023, 7 (04)
  • [6] A SWOT (Strengths, Weaknesses, Opportunities, and Threats) Analysis of ChatGPT in the Medical Literature: Concise Review
    Goedde, Daniel
    Noehl, Sophia
    Wolf, Carina
    Rupert, Yannick
    Rimkus, Lukas
    Ehlers, Jan
    Breuckmann, Frank
    Sellmann, Timur
    [J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2023, 25
  • [7] May ChatGPT be a tool producing medical information for common inflammatory bowel disease patients' questions? An evidence-controlled analysis
    Gravina, Antonietta Gerarda
    Pellegrino, Raffaele
    Cipullo, Marina
    Palladino, Giovanna
    Imperio, Giuseppe
    Ventura, Andrea
    Auletta, Salvatore
    Ciamarra, Paola
    Federico, Alessandro
    [J]. WORLD JOURNAL OF GASTROENTEROLOGY, 2024, 30 (01) : 17 - 33
  • [8] Evaluating the Efficacy of ChatGPT in Navigating the Spanish Medical Residency Entrance Examination (MIR): Promising Horizons for AI in Clinical Medicine
    Guillen-Grima, Francisco
    Guillen-Aguinaga, Sara
    Guillen-Aguinaga, Laura
    Alas-Brun, Rosa
    Onambele, Luc
    Ortega, Wilfrido
    Montejo, Rocio
    Aguinaga-Ontoso, Enrique
    Barach, Paul
    Aguinaga-Ontoso, Ines
    [J]. CLINICS AND PRACTICE, 2023, 13 (06) : 1460 - 1487
  • [9] Survey of Hallucination in Natural Language Generation
    Ji, Ziwei
    Lee, Nayeon
    Frieske, Rita
    Yu, Tiezheng
    Su, Dan
    Xu, Yan
    Ishii, Etsuko
    Bang, Ye Jin
    Madotto, Andrea
    Fung, Pascale
    [J]. ACM COMPUTING SURVEYS, 2023, 55 (12)
  • [10] Use of ChatGPT on Taiwan's Examination for Medical Doctors
    Kao, Yung-Shuo
    Chuang, Wei-Kai
    Yang, Jen
    [J]. ANNALS OF BIOMEDICAL ENGINEERING, 2024, 52 (03) : 455 - 457