Assessing the precision of artificial intelligence in emergency department triage decisions: Insights from a study with ChatGPT

被引:18
作者
Paslia, Sinan [1 ,6 ]
Sahin, Abdul Samet [2 ]
Beser, Muhammet Fatih [3 ]
Topcuoglu, Hazal [4 ]
Yadigaroglu, Metin
Imamoglu, Melih [1 ,5 ]
机构
[1] Karadeniz Tech Univ, Fac Med, Dept Emergency Med, Trabzon, Turkiye
[2] Carsamba State Hosp, Emergency Dept, Samsun, Turkiye
[3] Business & Decis Life Sci, Brussels, Belgium
[4] Siirt Educ & Res Hosp, Dept Emergency Med, Siirt, Turkiye
[5] Samsun Univ, Fac Med, Dept Emergency Med, Samsun, Turkiye
[6] Karadeniz Tech Univ, Dept Emergency Med, TR-61080 Trabzon, Turkiye
关键词
Artificial intelligence; ChatGPT; Chatbot; Emergency department; Triage;
D O I
10.1016/j.ajem.2024.01.037
中图分类号
R4 [临床医学];
学科分类号
1002 ; 100602 ;
摘要
Background: The rise in emergency department presentations globally poses challenges for efficient patient management. To address this, various strategies aim to expedite patient management. Artificial intelligence's (AI) consistent performance and rapid data interpretation extend its healthcare applications, especially in emergencies. The introduction of a robust AI tool like ChatGPT, based on GPT-4 developed by OpenAI, can benefit patients and healthcare professionals by improving the speed and accuracy of resource allocation. This study examines ChatGPT's capability to predict triage outcomes based on local emergency department rules. Methods: This study is a single -center prospective observational study. The study population consists of all patients who presented to the emergency department with any symptoms and agreed to participate. The study was conducted on three non-consecutive days for a total of 72 h. Patients' chief complaints, vital parameters, medical history and the area to which they were directed by the triage team in the emergency department were recorded. Concurrently, an emergency medicine physician inputted the same data into previously trained GPT-4, according to local rules. According to this data, the triage decisions made by GPT-4 were recorded. In the same process, an emergency medicine specialist determined where the patient should be directed based on the data collected, and this decision was considered the gold standard. Accuracy rates and reliability for directing patients to specific areas by the triage team and GPT-4 were evaluated using Cohen's kappa test. Furthermore, the accuracy of the patient triage process performed by the triage team and GPT-4 was assessed by receiver operating characteristic (ROC) analysis. Statistical analysis considered a value of p < 0.05 as significant. Results: The study was carried out on 758 patients. Among the participants, 416 (54.9%) were male and 342 (45.1%) were female. Evaluating the primary endpoints of our study - the agreement between the decisions of the triage team, GPT-4 decisions in emergency department triage, and the gold standard - we observed almost perfect agreement both between the triage team and the gold standard and between GPT-4 and the gold standard (Cohen's Kappa 0.893 and 0.899, respectively; p < 0.001 for each). Conclusion: Our findings suggest GPT-4 possess outstanding predictive skills in triaging patients in an emergency setting. GPT-4 can serve as an effective tool to support the triage process. (c) 2024 Elsevier Inc. All rights reserved.
引用
收藏
页码:170 / 175
页数:6
相关论文
共 15 条
  • [2] Evaluation of the emergency severity index (version 3) triage algorithm in pediatric patients
    Baumann, MR
    Strout, TD
    [J]. ACADEMIC EMERGENCY MEDICINE, 2005, 12 (03) : 219 - 224
  • [3] How artificial intelligence could transform emergency department operations
    Berlyand, Yosef
    Raja, Ali S.
    Dorner, Stephen C.
    Prabhakar, Anand M.
    Sonis, Jonathan D.
    Gottumukkala, Ravi V.
    Succi, Marc David
    Yun, Brian J.
    [J]. AMERICAN JOURNAL OF EMERGENCY MEDICINE, 2018, 36 (08) : 1515 - 1517
  • [4] Revisions to the Canadian Emergency Department Triage and Acuity Scale (CTAS) Guidelines 2016
    Bullard, Michael J.
    Musgrave, Erin
    Warren, David
    Unger, Bernard
    Skeldon, Thora
    Grierson, Rob
    van der Linde, Etienne
    Swain, Janel
    [J]. CANADIAN JOURNAL OF EMERGENCY MEDICINE, 2017, 19 : S18 - S27
  • [5] A Literature Survey of Recent Advances in Chatbots
    Caldarini, Guendalina
    Jaf, Sardar
    McGarry, Kenneth
    [J]. INFORMATION, 2022, 13 (01)
  • [6] ChatGPT, 2024, Optimizing Language Models for Dialogue
  • [7] Considine J, 2000, Accid Emerg Nurs, V8, P201, DOI 10.1054/aaen.2000.0166
  • [8] Farahmand S, 2017, ADV J EMERG MED, V1, DOI [10.22114/ajem.v1i1.11, 10.22114/AJEM.v1i1.11]
  • [9] Enhancing Triage Efficiency and Accuracy in Emergency Rooms for Patients with Metastatic Prostate Cancer: A Retrospective Analysis of Artificial Intelligence-Assisted Triage Using ChatGPT 4.0
    Gebrael, Georges
    Sahu, Kamal Kant
    Chigarira, Beverly
    Tripathi, Nishita
    Thomas, Vinay Mathew
    Sayegh, Nicolas
    Maughan, Benjamin L. L.
    Agarwal, Neeraj
    Swami, Umang
    Li, Haoran
    [J]. CANCERS, 2023, 15 (14)
  • [10] Diagnostic accuracy of a large language model in rheumatology: comparison of physician and ChatGPT-4
    Krusche, Martin
    Callhoff, Johnna
    Knitza, Johannes
    Ruffer, Nikolas
    [J]. RHEUMATOLOGY INTERNATIONAL, 2024, 44 (02) : 303 - 306