Mind

被引:12
作者
Ayoub, Marc [1 ,2 ]
Ballout, Ahmad A. [3 ]
Zayek, Rosana A. [4 ]
Ayoub, Noel F. [5 ]
机构
[1] Northshore Univ Hosp, Neurocrit Care, Northwell, Manhasset, NY USA
[2] Elmhurst Hosp Ctr, Mt Sinai Sch Med, Internal Med, New York, NY USA
[3] Donald & Barbara Zucker Sch Med Hofstra Northwell, Neurol, Long Isl City, NY USA
[4] Torrance Mem Med Ctr, Internal Med, Torrance, CA USA
[5] Stanford Hlth Care, Otolaryngol Head & Neck Surg, Palo Alto, CA 94305 USA
关键词
large language model; generative artificial intelligence; triage; clinical decision support system; artificial intelligence in healthcare; chatgpt;
D O I
10.7759/cureus.43690
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background Generative artificial intelligence (AI) has integrated into various industries as it has demonstrated enormous potential in automating elaborate processes and enhancing complex decision-making. The ability of these chatbots to critically triage, diagnose, and manage complex medical conditions, remains unknown and requires further research.Objective This cross-sectional study sought to quantitatively analyze the appropriateness of ChatGPT (OpenAI, San Francisco, CA, US) in its ability to triage, synthesize differential diagnoses, and generate treatment plans for nine diverse but common clinical scenarios.Methods Various common clinical scenarios were developed. Each was input into ChatGPT, and the chatbot was asked to develop diagnostic and treatment plans. Five practicing physicians independently scored ChatGPT's responses to the clinical scenarios.Results The average overall score for the triage ranking was 4.2 (SD 0.7). The lowest overall score was for the completeness of the differential diagnosis at 4.1 (0.5). The highest overall scores were seen with the accuracy of the differential diagnosis, initial treatment plan, and overall usefulness of the response (all with an average score of 4.4). Variance among physician scores ranged from 0.24 for accuracy of the differential diagnosis to 0.49 for appropriateness of triage ranking.Discussion ChatGPT has the potential to augment clinical decision-making. More extensive research, however, is needed to ensure accuracy and appropriate recommendations are provided.
引用
收藏
页数:4
相关论文
共 9 条
[1]   Will ChatGPT transform healthcare? [J].
不详 .
NATURE MEDICINE, 2023, 29 (03) :505-506
[2]   Comparing Physician and Artificial Intelligence Chatbot Responses to Patient Questions Posted to a Public Social Media Forum [J].
Ayers, John W. ;
Poliak, Adam ;
Dredze, Mark ;
Leas, Eric C. ;
Zhu, Zechariah ;
Kelley, Jessica B. ;
Faix, Dennis J. ;
Goodman, Aaron M. ;
Longhurst, Christopher A. ;
Hogarth, Michael ;
Smith, Davey M. .
JAMA INTERNAL MEDICINE, 2023, 183 (06) :589-596
[3]   Comparison Between ChatGPT and Google Search as Sources of Postoperative Patient Instructions [J].
Ayoub, Noel F. ;
Lee, Yu-Jin ;
Grimm, David ;
Balakrishnan, Karthik .
JAMA OTOLARYNGOLOGY-HEAD & NECK SURGERY, 2023, 149 (06) :556-+
[4]   ChatGPT in medicine: an overview of its applications, advantages, limitations, future prospects, and ethical considerations [J].
Dave, Tirth ;
Athaluri, Sai Anirudh ;
Singh, Satyam .
FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2023, 6
[5]   Prompt Engineering with ChatGPT: A Guide for Academic Writers [J].
Giray, Louie .
ANNALS OF BIOMEDICAL ENGINEERING, 2023, 51 (12) :2629-2633
[6]   Performance of ChatGPT on USMLE: Potential for AI-assisted medical education using large language models [J].
Kung, Tiffany H. ;
Cheatham, Morgan ;
Medenilla, Arielle ;
Sillos, Czarina ;
De Leon, Lorie ;
Elepano, Camille ;
Madriaga, Maria ;
Aggabao, Rimel ;
Diaz-Candido, Giezel ;
Maningo, James ;
Tseng, Victor .
PLOS DIGITAL HEALTH, 2023, 2 (02)
[7]   Medical error-the third leading cause of death in the US [J].
Makary, Martin A. ;
Daniel, Michael .
BMJ-BRITISH MEDICAL JOURNAL, 2016, 353
[8]   Artificial intelligence to diagnose ischemic stroke and identify large vessel occlusions: a systematic review [J].
Murray, Nick M. ;
Unberath, Mathias ;
Hager, Gregory D. ;
Hui, Ferdinand K. .
JOURNAL OF NEUROINTERVENTIONAL SURGERY, 2020, 12 (02) :156-+
[9]   Appropriateness of Cardiovascular Disease Prevention Recommendations Obtained From a Popular Online Chat-Based Artificial Intelligence Model [J].
Sarraju, Ashish ;
Bruemmer, Dennis ;
Van Iterson, Erik ;
Cho, Leslie ;
Rodriguez, Fatima ;
Laffin, Luke .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2023, 329 (10) :842-844