Generative Pre-trained Transformer 4 makes cardiovascular magnetic resonance reports easy to understand

被引:13
|
作者
Salam, Babak [1 ,2 ]
Kravchenko, Dmitrij [1 ,2 ]
Nowak, Sebastian [1 ,2 ]
Sprinkart, Alois M. [1 ,2 ]
Weinhold, Leonie [3 ]
Odenthal, Anna [1 ]
Mesropyan, Narine [1 ,2 ]
Bischoff, Leon M. [1 ,2 ]
Attenberger, Ulrike [1 ]
Kuetting, Daniel L. [1 ,2 ]
Luetkens, Julian A. [1 ,2 ]
Isaak, Alexander [1 ,2 ]
机构
[1] Univ Hosp Bonn, Dept Diagnost & Intervent Radiol, Venusberg Campus 1, D-53127 Bonn, Germany
[2] Univ Hosp Bonn, Quant Imaging Lab Bonn QILaB, Venusberg Campus 1, D-53127 Bonn, Germany
[3] Univ Hosp Bonn, Dept Med Biometry Informat & Epidemiol, Venusberg Campus 1, D-53127 Bonn, Germany
关键词
Generative Pre-trained Transformers; Cardiovascular magnetic resonance; Artificial intelligence; Text simplification; Large language models; RADIOLOGY REPORTS; READABILITY;
D O I
10.1016/j.jocmr.2024.101035
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background: Patients are increasingly using Generative Pre-trained Transformer 4 (GPT-4) to better understand their own radiology findings. Purpose: To evaluate the performance of GPT-4 in transforming cardiovascular magnetic resonance (CMR) reports into text that is comprehensible to medical laypersons. Methods: ChatGPT with GPT-4 architecture was used to generate three different explained versions of 20 various CMR reports (n = 60) using the same prompt: "Explain the radiology report in a language understandable to a medical layperson". Two cardiovascular radiologists evaluated understandability, factual correctness, completeness of relevant findings, and lack of potential harm, while 13 medical laypersons evaluated the understandability of the original and the GPT-4 reports on a Likert scale (1 "strongly disagree", 5 "strongly agree"). Readability was measured using the Automated Readability Index (ARI). Linear mixed-effects models (values given as median [interquartile range]) and intraclass correlation coefficient (ICC) were used for statistical analysis. Results: GPT-4 reports were generated on average in 52 s +/- 13. GPT-4 reports achieved a lower ARI score (10 [9-12] vs 5 [4-6]; p < 0.001) and were subjectively easier to understand for laypersons than original reports (1 [1] vs 4 [4,5]; p < 0.001). Eighteen out of 20 (90%) standard CMR reports and 2/60 (3%) GPT-generated reports had an ARI score corresponding to the 8th grade level or higher. Radiologists' ratings of the GPT-4 reports reached high levels for correctness (5 [4, 5]), completeness (5 [5]), and lack of potential harm (5 [5]); with "strong agreement" for factual correctness in 94% (113/120) and completeness of relevant findings in 81% (97/120) of reports. Test-retest agreement for layperson understandability ratings between the three simplified reports generated from the same original report was substantial (ICC: 0.62; p < 0.001). Interrater agreement between radiologists was almost perfect for lack of potential harm (ICC: 0.93, p < 0.001) and moderate to substantial for completeness (ICC: 0.76, p < 0.001) and factual correctness (ICC: 0.55, p < 0.001). Conclusion: GPT-4 can reliably transform complex CMR reports into more understandable, layperson-friendly language while largely maintaining factual correctness and completeness, and can thus help convey patientrelevant radiology information in an easy-to-understand manner.
引用
收藏
页数:8
相关论文
共 37 条
  • [21] Alternative Approaches to HVAC Control of Chat Generative Pre-Trained Transformer (ChatGPT) for Autonomous Building System Operations
    Ahn, Ki Uhn
    Kim, Deuk-Woo
    Cho, Hyun Mi
    Chae, Chang-U
    BUILDINGS, 2023, 13 (11)
  • [22] Biomedical generative pre-trained based transformer language model for age-related disease target discovery
    Zagirova, Diana
    Pushkov, Stefan
    Leung, Geoffrey Ho Duen
    Liu, Bonnie Hei Man
    Urban, Anatoly
    Sidorenko, Denis
    Kalashnikov, Aleksandr
    Kozlova, Ekaterina
    Naumov, Vladimir
    Pun, Frank W.
    Ozerov, Ivan V.
    Aliper, Alex
    Zhavoronkov, Alex
    AGING-US, 2023, 15 (18): : 9293 - 9309
  • [23] The Expanding Role of ChatGPT (Chat-Generative Pre-Trained Transformer) in Neurosurgery: A Systematic Review of Literature and Conceptual Framework
    Roman, Alex
    Al-Sharif, Lubna
    Gharyani, Mohamed A. L.
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (08)
  • [24] Chat Generative Pre-Trained Transformer (ChatGPT) in Oral and Maxillofacial Surgery: A Narrative Review on Its Research Applications and Limitations
    On, Sung-Woon
    Cho, Seoung-Won
    Park, Sang-Yoon
    Ha, Ji-Won
    Yi, Sang-Min
    Park, In-Young
    Byun, Soo-Hwan
    Yang, Byoung-Eun
    JOURNAL OF CLINICAL MEDICINE, 2025, 14 (04)
  • [25] GPT (Generative Pre-Trained Transformer)-A Comprehensive Review on Enabling Technologies, Potential Applications, Emerging Challenges, and Future Directions
    Yenduri, Gokul
    Ramalingam, M.
    Selvi, G. Chemmalar
    Supriya, Y.
    Srivastava, Gautam
    Maddikunta, Praveen Kumar Reddy
    Raj, G. Deepti
    Jhaveri, Rutvij H.
    Prabadevi, B.
    Wang, Weizheng
    Vasilakos, Athanasios V.
    Gadekallu, Thippa Reddy
    IEEE ACCESS, 2024, 12 : 54608 - 54649
  • [26] Blepharoptosis Consultation with Artificial Intelligence: Aesthetic Surgery Advice and Counseling from Chat Generative Pre-Trained Transformer (ChatGPT)
    Shiraishi, Makoto
    Tanigawa, Koji
    Tomioka, Yoko
    Miyakuni, Ami
    Moriwaki, Yuta
    Yang, Rui
    Oba, Jun
    Okazaki, Mutsumi
    AESTHETIC PLASTIC SURGERY, 2024, 48 (11) : 2057 - 2063
  • [27] Performance of a commercially available Generative Pre-trained Transformer (GPT) in describing radiolucent lesions in panoramic radiographs and establishing differential diagnoses
    Thaísa Pinheiro Silva
    Maria Fernanda Silva Andrade-Bortoletto
    Thaís Santos Cerqueira Ocampo
    Caio Alencar-Palha
    Michael M. Bornstein
    Christiano Oliveira-Santos
    Matheus L. Oliveira
    Clinical Oral Investigations, 28
  • [28] Performance of a commercially available Generative Pre-trained Transformer (GPT) in describing radiolucent lesions in panoramic radiographs and establishing differential diagnoses
    Silva, Thaisa Pinheiro
    Andrade-Bortoletto, Maria Fernanda Silva
    Ocampo, Thais Santos Cerqueira
    Alencar-Palha, Caio
    Bornstein, Michael M.
    Oliveira-Santos, Christiano
    Oliveira, Matheus L.
    CLINICAL ORAL INVESTIGATIONS, 2024, 28 (03)
  • [29] GPT4MIA: Utilizing Generative Pre-trained Transformer (GPT-3) as a Plug-and-Play Transductive Model for Medical Image Analysis
    Zhang, Yizhe
    Chen, Danny Z.
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023 WORKSHOPS, 2023, 14393 : 151 - 160
  • [30] Is generative pre-trained transformer artificial intelligence (Chat-GPT) a reliable tool for guidelines synthesis? A preliminary evaluation for biologic CRSwNP therapy
    Maniaci, Antonino
    Saibene, Alberto Maria
    Calvo-Henriquez, Christian
    Vaira, Luigi
    Radulesco, Thomas
    Michel, Justin
    Chiesa-Estomba, Carlos
    Sowerby, Leigh
    Lobo Duro, David
    Mayo-Yanez, Miguel
    Maza-Solano, Juan
    Lechien, Jerome Rene
    La Mantia, Ignazio
    Cocuzza, Salvatore
    EUROPEAN ARCHIVES OF OTO-RHINO-LARYNGOLOGY, 2024, 281 (04) : 2167 - 2173