Evaluating the accuracy of ChatGPT-4 in predicting ASA scores: A prospective multicentric study ChatGPT-4 in ASA score prediction

被引:9
作者
Turan, Engin Ihsan [1 ,3 ]
Baydemir, Abdurrahman Engin [2 ]
Ozcan, Funda Gumus
Sahin, Ayca Sultan [1 ]
机构
[1] Istanbul Hlth Sci Univ, Dept Anesthesiol, Kanuni Sultan Suleyman Educ & Training Hosp, Istanbul, Turkiye
[2] Basaksehir Cam ve Sakura City Hosp, Dept Anesthesiol, Istanbul, Turkiye
[3] Istanbul Hlth Sci Univ, Anesthesiol & Reanimat Dept, Dept Gastroenterol, Kanuni Sultan Suleyman Hosp, Atakent Mahallesi Turgut Ozal Bulvari 46-1, TR-34303 Istanbul, Turkiye
关键词
D O I
10.1016/j.jclinane.2024.111475
中图分类号
R614 [麻醉学];
学科分类号
100217 ;
摘要
Background: This study investigates the potential of ChatGPT-4, developed by OpenAI, in enhancing medical decision-making processes, particularly in preoperative assessments using the American Society of Anesthesiologists (ASA) scoring system. The ASA score, a critical tool in evaluating patients' health status and anesthesia risks before surgery, categorizes patients from I to VI based on their overall health and risk factors. Despite its widespread use, determining accurate ASA scores remains a subjective process that may benefit from AI-supported assessments. This research aims to evaluate ChatGPT-4's capability to predict ASA scores accurately compared to expert anesthesiologists' assessments. Methods: In this prospective multicentric study, ethical board approval was obtained, and the study was registered with clinicaltrials.gov (NCT06321445). We included 2851 patients from anesthesiology outpatient clinics, spanning neonates to all age groups and genders, with ASA scores between I-IV. Exclusion criteria were set for ASA V and VI scores, emergency operations, and insufficient information for ASA score determination. Data on patients' demographics, health conditions, and ASA scores by anesthesiologists were collected and anonymized. ChatGPT-4 was then tasked with assigning ASA scores based on the standardized patient data. Results: Our results indicate a high level of concordance between ChatGPT-4 predictions and anesthesiologists' evaluations, with Cohen's kappa analysis showing a kappa value of 0.858 ( p = 0.000). While the model demonstrated over 90% accuracy in predicting ASA scores I to III, it showed a notable variance in ASA IV scores, suggesting a potential limitation in assessing patients with more complex health conditions. Discussion: The findings suggest that ChatGPT-4 can significantly contribute to the medical field by supporting anesthesiologists in preoperative assessments. This study not only demonstrates ChatGPT-4's efficacy in medical data analysis and decision-making but also opens new avenues for AI applications in healthcare, particularly in enhancing patient safety and optimizing surgical outcomes. Further research is needed to refine AI models for complex case assessments and integrate them seamlessly into clinical workflows.
引用
收藏
页数:7
相关论文
共 13 条
  • [1] Boulos K, 2023, JMIR Med Educ, V9
  • [2] ASA physical status assignment by non-anesthesia providers: Do surgeons consistently downgrade the ASA score preoperatively?
    Curatolo, Christopher
    Goldberg, Andrew
    Maerz, David
    Lin, Hung-Mo
    Shah, Hardikkumar
    Muoi Trinh
    [J]. JOURNAL OF CLINICAL ANESTHESIA, 2017, 38 : 123 - 128
  • [3] ChatGPT in medicine: an overview of its applications, advantages, limitations, future prospects, and ethical considerations
    Dave, Tirth
    Athaluri, Sai Anirudh
    Singh, Satyam
    [J]. FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2023, 6
  • [4] Comorbidity and the association with 1-year mortality in hip fracture patients: can the ASA score and the Charlson Comorbidity Index be used interchangeably?
    Ek, Stina
    Meyer, Anna C.
    Hedstrom, Margareta
    Modig, Karin
    [J]. AGING CLINICAL AND EXPERIMENTAL RESEARCH, 2022, 34 (01) : 129 - 136
  • [5] Fleisher LA, 2014, J AM COLL CARDIOL, V64, P2373, DOI [10.1016/j.jacc.2014.07.945, 10.1016/j.jacc.2014.07.944]
  • [6] Impact of ASA score misclassification on NSQIP predicted mortality: a retrospective analysis
    Helkin, Alex
    Jain, Sumeet V.
    Gruessner, Angelika
    Fleming, Maureen
    Kohman, Leslie
    Costanza, Michael
    Cooney, Robert N.
    [J]. PERIOPERATIVE MEDICINE, 2017, 6
  • [7] Identifying depression and its determinants upon initiating treatment: ChatGPT versus primary care physicians
    Levkovich, Inbar
    Elyoseph, Zohar
    [J]. FAMILY MEDICINE AND COMMUNITY HEALTH, 2023, 11 (04)
  • [8] Machine Learning Algorithm to Perform the American Society of Anesthesiologists Physical Status Classification
    Lew, Michael W.
    Pozhitkov, Alex
    Rossi, Lorenzo
    Raytis, John
    Kidambi, Trilokesh
    [J]. CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (10)
  • [9] Utility of ChatGPT in Clinical Practice
    Liu, Jialin
    Wang, Changyu
    Liu, Siru
    [J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2023, 25
  • [10] ASA score and procedure type predict complications and costs in maxillofacial reconstructive surgery: a retrospective study using a hospital administrative database
    Mehra, Tarun
    Schoenegg, Daphne
    Ebner, Julian
    Moos, Rudolf M.
    Schumann, Paul
    Gander, Thomas
    Essig, Harald
    Lanzer, Martin
    [J]. SWISS MEDICAL WEEKLY, 2021, 151