Feasibility of GPT-3 and GPT-4 for in-Depth Patient Education Prior to Interventional Radiological Procedures: A Comparative Analysis

被引:18
作者
Scheschenja, Michael [1 ]
Viniol, Simon [1 ]
Bastian, Moritz B. [1 ]
Wessendorf, Joel [1 ]
Koenig, Alexander M. [1 ]
Mahnken, Andreas H. [1 ]
机构
[1] Philipps Univ Marburg, Univ Hosp Marburg, Dept Diagnost & Intervent Radiol, Baldingerstr 1, D-35043 Marburg, Germany
关键词
Artificial intelligence; Patient education; Interventional radiology; Chat-GPT; Large language models;
D O I
10.1007/s00270-023-03563-2
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
PurposeThis study explores the utility of the large language models, GPT-3 and GPT-4, for in-depth patient education prior to interventional radiology procedures. Further, differences in answer accuracy between the models were assessed.Materials and methodsA total of 133 questions related to three specific interventional radiology procedures (Port implantation, PTA and TACE) covering general information as well as preparation details, risks and complications and post procedural aftercare were compiled. Responses of GPT-3 and GPT-4 were assessed for their accuracy by two board-certified radiologists using a 5-point Likert scale. The performance difference between GPT-3 and GPT-4 was analyzed.ResultsBoth GPT-3 and GPT-4 responded with (5) "completely correct" (4) "very good" answers for the majority of questions ((5) 30.8% + (4) 48.1% for GPT-3 and (5) 35.3% + (4) 47.4% for GPT-4). GPT-3 and GPT-4 provided (3) "acceptable" responses 15.8% and 15.0% of the time, respectively. GPT-3 provided (2) "mostly incorrect" responses in 5.3% of instances, while GPT-4 had a lower rate of such occurrences, at just 2.3%. No response was identified as potentially harmful. GPT-4 was found to give significantly more accurate responses than GPT-3 (p = 0.043).ConclusionGPT-3 and GPT-4 emerge as relatively safe and accurate tools for patient education in interventional radiology. GPT-4 showed a slightly better performance. The feasibility and accuracy of these models suggest their promising role in revolutionizing patient care. Still, users need to be aware of possible limitations.
引用
收藏
页码:245 / 250
页数:6
相关论文
共 14 条
  • [1] Exploring the Boundaries of Reality: Investigating the Phenomenon of Artificial Intelligence Hallucination in Scientific Writing Through ChatGPT References
    Athaluri, Sai Anirudh
    Manthena, Sandeep Varma
    Kesapragada, V. S. R. Krishna Manoj
    Yarlagadda, Vineel
    Dave, Tirth
    Duddumpudi, Rama Tulasi Siri
    [J]. CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (04)
  • [2] Gertz Roman Johannes, 2023, Radiology, V307, pe230877, DOI 10.1148/radiol.230877
  • [3] An Evaluation of Trends in Patient and Public Awareness of IR
    Heister, David
    Jackson, Susan
    Doherty-Simor, Margaret
    Newton, Isabel
    [J]. JOURNAL OF VASCULAR AND INTERVENTIONAL RADIOLOGY, 2018, 29 (05) : 661 - 668
  • [4] Koski Eileen, 2021, Stud Health Technol Inform, V284, P295, DOI 10.3233/SHTI210726
  • [5] Revolutionizing radiology with GPT-based models: Current applications, future possibilities and limitations of ChatGPT
    Lecler, Augustin
    Duron, Loic
    Soyer, Philippe
    [J]. DIAGNOSTIC AND INTERVENTIONAL IMAGING, 2023, 104 (06) : 269 - 274
  • [6] Translating radiology reports into plain language using ChatGPT and GPT-4 with prompt learning: results, limitations, and potential
    Lyu, Qing
    Tan, Josh
    Zapadka, Michael E.
    Ponnatapura, Janardhana
    Niu, Chuang
    Myers, Kyle J.
    Wang, Ge
    Whitlow, Christopher T.
    [J]. VISUAL COMPUTING FOR INDUSTRY BIOMEDICINE AND ART, 2023, 6 (01)
  • [7] CIRSE Clinical Practice Manual
    Mahnken, Andreas H.
    Boullosa Seoane, Esther
    Cannavale, Allesandro
    de Haan, Michiel W.
    Dezman, Rok
    Kloeckner, Roman
    O'Sullivan, Gerard
    Ryan, Anthony
    Tsoumakidou, Georgia
    [J]. CARDIOVASCULAR AND INTERVENTIONAL RADIOLOGY, 2021, 44 (09) : 1323 - 1353
  • [8] Large language models for structured reporting in radiology: performance of GPT-4, ChatGPT-3.5, Perplexity and Bing
    Mallio, Carlo A.
    Sertorio, Andrea C.
    Bernetti, Caterina
    Beomonte Zobel, Bruno
    [J]. RADIOLOGIA MEDICA, 2023, 128 (07): : 808 - 812
  • [9] Evaluation of an Artificial Intelligence Chatbot for Delivery of IR Patient Education Material: A Comparison with Societal Website Content
    McCarthy, Colin J.
    Berkowitz, Seth
    Ramalingam, Vijay
    Ahmed, Muneeb
    [J]. JOURNAL OF VASCULAR AND INTERVENTIONAL RADIOLOGY, 2023, 34 (10) : 1760 - +