The Potential of Using Generative AI/NLP to Identify and Analyse Critical Incidents in a Critical Incident Reporting System (CIRS): A Feasibility Case-Control Study

被引:0
作者
Hoelzing, Carlos Ramon [1 ]
Rumpf, Sebastian [1 ]
Huber, Stephan [2 ]
Papenfuss, Nathalie [2 ]
Meybohm, Patrick [1 ]
Happel, Oliver [1 ]
机构
[1] Univ Hosp Wurzburg, Dept Anaesthesiol Intens Care Emergency & Pain Med, Oberdurrbacher Str 6, D-97080 Wurzburg, Germany
[2] Univ Wurzburg, Psychol Ergon, D-97070 Wurzburg, Germany
关键词
patient safety; healthcare quality improvement; human factors; human error; safety culture;
D O I
10.3390/healthcare12191964
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: To enhance patient safety in healthcare, it is crucial to address the underreporting of issues in Critical Incident Reporting Systems (CIRSs). This study aims to evaluate the effectiveness of generative Artificial Intelligence and Natural Language Processing (AI/NLP) in reviewing CIRS cases by comparing its performance with human reviewers and categorising these cases into relevant topics. Methods: A case-control feasibility study was conducted using CIRS cases from the German CIRS-Anaesthesiology subsystem. Each case was reviewed by a human expert and by an AI/NLP model (ChatGPT-3.5). Two CIRS experts blindly assessed these reviews, rating them on linguistic quality, recognisable expertise, logical derivability, and overall quality using six-point Likert scales. Results: On average, the CIRS experts correctly classified 80% of human CIRS reviews as created by a human and misclassified 45.8% of AI reviews as written by a human. Ratings on a scale of 1 (very good) to 6 (failed) revealed a comparable performance between human- and AI-generated reviews across the dimensions of linguistic expression (p = 0.39), recognisable expertise (p = 0.89), logical derivability (p = 0.84), and overall quality (p = 0.87). The AI model was able to categorise the cases into relevant topics independently. Conclusions: This feasibility study demonstrates the potential of generative AI/NLP in analysing and categorising cases from the CIRS. This could have implications for improving incident reporting in healthcare. Therefore, additional research is required to verify and expand upon these discoveries.
引用
收藏
页数:8
相关论文
共 23 条
  • [1] A Comprehensive Evaluation of AI-Assisted Diagnostic Tools in ENT Medicine: Insights and Perspectives from Healthcare Professionals
    Alshehri, Sarah
    Alahmari, Khalid A.
    Alasiry, Areej
    [J]. JOURNAL OF PERSONALIZED MEDICINE, 2024, 14 (04):
  • [2] [Anonymous], 2020, Patient safety incident reporting and learning systems: technical report and guidance
  • [3] Development of a theoretical framework of factors affecting patient safety incident reporting: a theoretical review of the literature
    Archer, Stephanie
    Hull, Louise
    Soukup, Tayana
    Mayer, Erik
    Athanasiou, Thanos
    Sevdalis, Nick
    Darzi, Ara
    [J]. BMJ OPEN, 2017, 7 (12):
  • [4] Feedback from incident reporting: information and action to improve patient safety
    Benn, J.
    Koutantji, M.
    Wallace, L.
    Spurgeon, P.
    Rejman, M.
    Healey, A.
    Vincent, C.
    [J]. QUALITY & SAFETY IN HEALTH CARE, 2009, 18 (01): : 11 - U33
  • [5] Role of Artificial Intelligence in Patient Safety Outcomes: Systematic Literature Review
    Choudhury, Avishek
    Asan, Onur
    [J]. JMIR MEDICAL INFORMATICS, 2020, 8 (07)
  • [6] Use of natural language processing method to identify regional anesthesia from clinical notes
    Graham, Laura A.
    Illarmo, Samantha S.
    Wren, Sherry M.
    Odden, Michelle C.
    Mudumbai, Seshadri C.
    [J]. REGIONAL ANESTHESIA AND PAIN MEDICINE, 2024, : 271 - 275
  • [7] Artificial Intelligence (AI) for the early detection of breast cancer: a scoping review to assess AI's potential in breast screening practice
    Houssami, Nehmat
    Kirkpatrick-Jones, Georgia
    Noguchi, Naomi
    Lee, Christoph I.
    [J]. EXPERT REVIEW OF MEDICAL DEVICES, 2019, 16 (05) : 351 - 362
  • [8] Influence of augmentation on the performance of the double ResNet-based model for chest X-ray classification
    Kloska, Anna
    Tarczewska, Martyna
    Gielczyk, Agata
    Kloska, Sylwester Micha
    Michalski, Adrian
    Serafin, Zbigniew
    Wozniak, Marcin
    [J]. POLISH JOURNAL OF RADIOLOGY, 2023, 88 : E244 - E250
  • [9] Kohn L, 1999, To Err Is Human: Building a Safer Health System, DOI [10.17226/ 9728, 10.17226/9728]
  • [10] The problem with incident reporting
    Macrae, Carl
    [J]. BMJ QUALITY & SAFETY, 2016, 25 (02) : 71 - 75