ChatGPT vs UpToDate: comparative study of usefulness and reliability of Chatbot in common clinical presentations of otorhinolaryngology-head and neck surgery

被引:10
作者
Karimov, Ziya [1 ]
Allahverdiyev, Irshad [2 ]
Agayarov, Ozlem Yagiz [3 ]
Demir, Dogukan [3 ]
Almuradova, Elvina [4 ,5 ]
机构
[1] Ege Univ, Med Program, Fac Med, TR-35100 Izmir, Turkiye
[2] Istanbul Univ, Istanbul Fac Med, Program Med, Istanbul, Turkiye
[3] Hlth Sci Univ, Izmir Tepecik Educ & Res Hosp, Dept Otolaryngol Head & Neck Surg, Izmir, Turkiye
[4] Ege Univ, Fac Med, Dept Med Oncol, Izmir, Turkiye
[5] Medicana Int Hosp, Dept Oncol, Izmir, Turkiye
关键词
Artificial intelligence; Chatbot; ChatGPT; ENT; UpToDate; Otorhinolaryngology and head and neck surgery; EPIDEMIOLOGY; AGREEMENT;
D O I
10.1007/s00405-023-08423-w
中图分类号
R76 [耳鼻咽喉科学];
学科分类号
100213 ;
摘要
Purpose The usage of Chatbots as a kind of Artificial Intelligence in medicine is getting to increase in recent years. UpToDate (R) is another well-known search tool established on evidence-based knowledge and is used daily by doctors worldwide. In this study, we aimed to investigate the usefulness and reliability of ChatGPT compared to UpToDate in Otorhinolaryngology and Head and Neck Surgery (ORL-HNS).Materials and methods ChatGPT-3.5 and UpToDate were interrogated for the management of 25 common clinical case scenarios (13 males/12 females) recruited from literature considering the daily observation at the Department of Otorhinolaryngology of Ege University Faculty of Medicine. Scientific references for the management were requested for each clinical case. The accuracy of the references in the ChatGPT answers was assessed on a 0-2 scale and the usefulness of the ChatGPT and UpToDate answers was assessed with 1-3 scores by reviewers. UpToDate and ChatGPT 3.5 responses were compared.Results ChatGPT did not give references in some questions in contrast to UpToDate. Information on the ChatGPT was limited to 2021. UpToDate supported the paper with subheadings, tables, figures, and algorithms. The mean accuracy score of references in ChatGPT answers was 0.25-weak/unrelated. The median (Q1-Q3) was 1.00 (1.25-2.00) for ChatGPT and 2.63 (2.75-3.00) for UpToDate, the difference was statistically significant (p < 0.001). UpToDate was observed more useful and reliable than ChatGPT.Conclusions ChatGPT has the potential to support the physicians to find out the information but our results suggest that ChatGPT needs to be improved to increase the usefulness and reliability of medical evidence-based knowledge.
引用
收藏
页码:2145 / 2151
页数:7
相关论文
共 35 条
[1]   How doctors make use of online, point-of-care clinical decision support systems: a case study of UpToDate© [J].
Addison, John ;
Whitcombe, Jo ;
Glover, Steven William .
HEALTH INFORMATION AND LIBRARIES JOURNAL, 2013, 30 (01) :13-22
[2]   A comparison of answer retrieval through four evidence-based textbooks (ACP PIER, Essential Evidence Plus, First Consult, and UpToDate): A randomized controlled trial [J].
Ahmadi, Seyed-Foad ;
Faghankhani, Masoomeh ;
Javanbakht, Anna ;
Akbarshahi, Maryam ;
Mirghorbani, Maryam ;
Safarnejad, Bahareh ;
Baradaran, Hamid .
MEDICAL TEACHER, 2011, 33 (09) :724-730
[3]  
[Anonymous], 2023, UPTODATE SUBSCRIPTIO
[4]  
[Anonymous], 2023, PRICING
[5]   Ethical Considerations in the Advent of Artificial Intelligence in Otolaryngology [J].
Arambula, Alexandra M. ;
Bur, Andres M. .
OTOLARYNGOLOGY-HEAD AND NECK SURGERY, 2020, 162 (01) :38-39
[6]   Head-to-Head Comparison of ChatGPT Versus Google Search for Medical Knowledge Acquisition [J].
Ayoub, Noel F. ;
Lee, Yu-Jin ;
Grimm, David ;
Divi, Vasu .
OTOLARYNGOLOGY-HEAD AND NECK SURGERY, 2024, 170 (06) :1484-1491
[7]   Association of a clinical knowledge support system with improved patient safety, reduced complications and shorter length of stay among Medicare beneficiaries in acute care hospitals in the United States [J].
Bonis, Peter A. ;
Pickens, Gary T. ;
Rind, David M. ;
Foster, David A. .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2008, 77 (11) :745-753
[8]   The role of ChatGPT in enhancing ENT surgical training - a trainees' perspective [J].
Brennan, Laura ;
Balakumar, Ramkishan ;
Bennett, Warren .
JOURNAL OF LARYNGOLOGY AND OTOLOGY, 2023, :480-486
[9]   Clinical Practice Guideline: Sudden Hearing Loss (Update) [J].
Chandrasekhar, Sujana S. ;
Do, Betty S. Tsai ;
Schwartz, Seth R. ;
Bontempo, Laura J. ;
Faucett, Erynne A. ;
Finestone, Sandra A. ;
Hollingsworth, Deena B. ;
Kelley, David M. ;
Kmucha, Steven T. ;
Moonis, Gul ;
Poling, Gayla L. ;
Roberts, J. Kirk ;
Stachler, Robert J. ;
Zeitler, Daniel M. ;
Corrigan, Maureen D. ;
Nnacheta, Lorraine C. ;
Satterfield, Lisa .
OTOLARYNGOLOGY-HEAD AND NECK SURGERY, 2019, 161 :S1-S45
[10]   Exploring the potential of Chat-GPT as a supportive tool for sialendoscopy clinical decision making and patient information support [J].
Chiesa-Estomba, Carlos M. ;
Lechien, Jerome R. ;
Vaira, Luigi A. ;
Brunet, Aina ;
Cammaroto, Giovanni ;
Mayo-Yanez, Miguel ;
Sanchez-Barrueco, Alvaro ;
Saga-Gutierrez, Carlos .
EUROPEAN ARCHIVES OF OTO-RHINO-LARYNGOLOGY, 2023, 281 (4) :2081-2086