Generative Pre-trained Transformer 4 makes cardiovascular magnetic resonance reports easy to understand

被引:13
|
作者
Salam, Babak [1 ,2 ]
Kravchenko, Dmitrij [1 ,2 ]
Nowak, Sebastian [1 ,2 ]
Sprinkart, Alois M. [1 ,2 ]
Weinhold, Leonie [3 ]
Odenthal, Anna [1 ]
Mesropyan, Narine [1 ,2 ]
Bischoff, Leon M. [1 ,2 ]
Attenberger, Ulrike [1 ]
Kuetting, Daniel L. [1 ,2 ]
Luetkens, Julian A. [1 ,2 ]
Isaak, Alexander [1 ,2 ]
机构
[1] Univ Hosp Bonn, Dept Diagnost & Intervent Radiol, Venusberg Campus 1, D-53127 Bonn, Germany
[2] Univ Hosp Bonn, Quant Imaging Lab Bonn QILaB, Venusberg Campus 1, D-53127 Bonn, Germany
[3] Univ Hosp Bonn, Dept Med Biometry Informat & Epidemiol, Venusberg Campus 1, D-53127 Bonn, Germany
关键词
Generative Pre-trained Transformers; Cardiovascular magnetic resonance; Artificial intelligence; Text simplification; Large language models; RADIOLOGY REPORTS; READABILITY;
D O I
10.1016/j.jocmr.2024.101035
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background: Patients are increasingly using Generative Pre-trained Transformer 4 (GPT-4) to better understand their own radiology findings. Purpose: To evaluate the performance of GPT-4 in transforming cardiovascular magnetic resonance (CMR) reports into text that is comprehensible to medical laypersons. Methods: ChatGPT with GPT-4 architecture was used to generate three different explained versions of 20 various CMR reports (n = 60) using the same prompt: "Explain the radiology report in a language understandable to a medical layperson". Two cardiovascular radiologists evaluated understandability, factual correctness, completeness of relevant findings, and lack of potential harm, while 13 medical laypersons evaluated the understandability of the original and the GPT-4 reports on a Likert scale (1 "strongly disagree", 5 "strongly agree"). Readability was measured using the Automated Readability Index (ARI). Linear mixed-effects models (values given as median [interquartile range]) and intraclass correlation coefficient (ICC) were used for statistical analysis. Results: GPT-4 reports were generated on average in 52 s +/- 13. GPT-4 reports achieved a lower ARI score (10 [9-12] vs 5 [4-6]; p < 0.001) and were subjectively easier to understand for laypersons than original reports (1 [1] vs 4 [4,5]; p < 0.001). Eighteen out of 20 (90%) standard CMR reports and 2/60 (3%) GPT-generated reports had an ARI score corresponding to the 8th grade level or higher. Radiologists' ratings of the GPT-4 reports reached high levels for correctness (5 [4, 5]), completeness (5 [5]), and lack of potential harm (5 [5]); with "strong agreement" for factual correctness in 94% (113/120) and completeness of relevant findings in 81% (97/120) of reports. Test-retest agreement for layperson understandability ratings between the three simplified reports generated from the same original report was substantial (ICC: 0.62; p < 0.001). Interrater agreement between radiologists was almost perfect for lack of potential harm (ICC: 0.93, p < 0.001) and moderate to substantial for completeness (ICC: 0.76, p < 0.001) and factual correctness (ICC: 0.55, p < 0.001). Conclusion: GPT-4 can reliably transform complex CMR reports into more understandable, layperson-friendly language while largely maintaining factual correctness and completeness, and can thus help convey patientrelevant radiology information in an easy-to-understand manner.
引用
收藏
页数:8
相关论文
共 37 条
  • [11] Performance of a Generative Pre-Trained Transformer in Generating Scientific Abstracts in Dentistry: A Comparative Observational Study
    Alencar-Palha, Caio
    Ocampo, Thais
    Silva, Thaisa Pinheiro
    Neves, Frederico Sampaio
    Oliveira, Matheus L.
    EUROPEAN JOURNAL OF DENTAL EDUCATION, 2025, 29 (01) : 149 - 154
  • [12] Performance of Chat Generative Pre-trained Transformer-4o in the Adult Clinical Cardiology Self-Assessment Program
    Malik, Abdulaziz
    Madias, Christopher
    Wessler, Benjamin S.
    EUROPEAN HEART JOURNAL - DIGITAL HEALTH, 2024, 6 (01): : 155 - 158
  • [13] Using the Chat Generative Pre-trained Transformer in academic writing in health: a scoping review
    Costa, Isabelle Cristinne Pinto
    do Nascimento, Murilo Cesar
    Treviso, Patricia
    Chini, Lucelia Terra
    Roza, Bartira de Aguiar
    Barbosa, Sayonara De Fatima Faria
    Mendes, Karina Dal Sasso
    REVISTA LATINO-AMERICANA DE ENFERMAGEM, 2024, 32
  • [14] Using the Chat Generative Pre-trained Transformer in academic health writing: a scoping review
    Pinto Costa, Isabelle Cristinne
    do Nascimento, Murilo Cesar
    Treviso, Patricia
    Chini, Lucelia Terra
    Roza, Bartira de Aguiar
    Barbosa, Sayonara De Fatima Faria
    Mendes, Karina Dal Sasso
    REVISTA LATINO-AMERICANA DE ENFERMAGEM, 2024, 32
  • [15] Performance of Chat Generative Pre-Trained Transformer on Personal Review of Learning in Obstetrics and Gynecology
    Cohen, Adam
    Burns, Jersey
    Gabra, Martina
    Gordon, Alex
    Deebel, Nicholas
    Terlecki, Ryan
    Woodburn, Katherine L.
    SOUTHERN MEDICAL JOURNAL, 2025, 118 (02) : 102 - 105
  • [16] Using the Chat Generative Pre-trained Transformer in academic writing in health: a scoping review
    Costa, Isabelle Cristinne Pinto
    do Nascimento, Murilo Cesar
    Treviso, Patricia
    Chini, Lucelia Terra
    Roza, Bartira de Aguiar
    Barbosa, Sayonara De Fatima Faria
    Mendes, Karina Dal Sasso
    REVISTA LATINO-AMERICANA DE ENFERMAGEM, 2024, 32
  • [17] Evolutionary Game Analysis of Artificial Intelligence Such as the Generative Pre-Trained Transformer in Future Education
    You, Yanwei
    Chen, Yuquan
    You, Yujun
    Zhang, Qi
    Cao, Qiang
    SUSTAINABILITY, 2023, 15 (12)
  • [18] Performance of Generative Pre-trained Transformer-4 (GPT-4) in Membership of the Royal College of General Practitioners (MRCGP)-style examination questions
    Armitage, Richard C.
    POSTGRADUATE MEDICAL JOURNAL, 2024, 100 (1182) : 274 - 275
  • [19] Advancement of Generative Pre-trained Transformer Chatbots in Answering Clinical Questions in the Practical Rhinoplasty Guideline
    Shiraishi, Makoto
    Tsuruda, Saori
    Tomioka, Yoko
    Chang, Jinwoo
    Hori, Asei
    Ishii, Saaya
    Fujinaka, Rei
    Ando, Taku
    Ohba, Jun
    Okazaki, Mutsumi
    AESTHETIC PLASTIC SURGERY, 2024, : 1874 - 1880
  • [20] The Impact of Chat Generative Pre-trained Transformer (ChatGPT) on Oncology: Application, Expectations, and Future Prospects
    Li, Yanxing
    Gao, Wentao
    Luan, Zhenhua
    Zhou, Zhi
    Li, Jianjun
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (11)