Use of large language models in radiological reports: A study on simplifying turkish MRI findings

被引:0
作者
Cesur, Turay [1 ]
Camur, Eren [2 ]
Gunes, Yasin Celal [3 ]
机构
[1] Ankara Mamak State Hosp, Dept Radiol, Ankara, Turkiye
[2] Ankara 29 Mayis State Hosp, Dept Radiol, Minist Hlth, Ankara, Turkiye
[3] Yuksek Ihtisas Hosp, Dept Radiol, Kirikkale, Turkiye
来源
ANNALS OF CLINICAL AND ANALYTICAL MEDICINE | 2024年 / 15卷 / 08期
关键词
Large Language Model; Radiology Reports; Readability; Health Communication;
D O I
10.4328/ACAM.22266
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Aim: Advanced Large Language Models (LLMs), like ChatGPT, are known for their human-like expression and reasoning abilities. They are used in many fields, including radiology. This study is pioneering in evaluating and comparing the effectiveness of LLMs in simplifying Magnetic Resonance Imaging (MRI) findings in Turkish. Material and Methods: In our study, we simplified 50 fictional MRI findings in Turkish language using different LLMs, including ChatGPT-4, Gemini Pro 1.5, Claude 3 Opus and Perplexity. We compared the responses based on Ate & scedil;man's readability index and word count. Additionally, three radiologists assessed the medical accuracy, consistency of suggestions, and comprehensibility of the answers, scoring each model on a scale of 1 to 5. Results: There was no statistically significant difference between the scores of Gemini 1.5 Pro (average: 4.9; median: 5.0), Opus (average: 4.8; median: 5.0), and ChatGPT-4 (average: 4.8; median: 5.0) (p>0.05). However, there was a significant difference between the scores of Gemini 1.5 Pro and Perplexity (average: 3.7; median: 4.0) (p<0.001). According to the readability index, Gemini 1.5 Pro had the highest average score of 59.3, which was significantly higher than the other LLMs (p<0.005). In terms of word count, ChatGPT-4 used the most words (151.5), while Perplexity used the fewest (88.4). Discussion: This study is the first to evaluate the ability of LLMs to simplify MRI findings in Turkish. The results suggest that radiologists find these models effective in making radiology reports more understandable. However, additional research is necessary to confirm these findings.
引用
收藏
页码:586 / 590
页数:5
相关论文
共 50 条
  • [1] A COMPARATIVE STUDY: PERFORMANCE OF LARGE LANGUAGE MODELS IN SIMPLIFYING TURKISH COMPUTED TOMOGRAPHY REPORTS
    Camur, Eren
    Cesur, Turay
    Gunes, Yasin Celal
    JOURNAL OF ISTANBUL FACULTY OF MEDICINE-ISTANBUL TIP FAKULTESI DERGISI, 2024, 87 (04): : 321 - 326
  • [2] Comparative Analysis of Large Language Models in Simplifying Turkish Ultrasound Reports to Enhance Patient Understanding
    Gunes, Yasin Celal
    Cesur, Turay
    Camur, Eren
    EUROPEAN JOURNAL OF THERAPEUTICS, 2024,
  • [3] Generating colloquial radiology reports with large language models
    Tang, Cynthia Crystal
    Nagesh, Supriya
    Fussell, David A.
    Glavis-Bloom, Justin
    Mishra, Nina
    Li, Charles
    Cortes, Gillean
    Hill, Robert
    Zhao, Jasmine
    Gordon, Angellica
    Wright, Joshua
    Troutt, Hayden
    Tarrago, Rod
    Chow, Daniel S.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024, 31 (11) : 2660 - 2667
  • [4] Evaluation of large language models performance against humans for summarizing MRI knee radiology reports: A feasibility study
    Lopez-Ubeda, Pilar
    Martin-Noguerol, Teodoro
    Diaz-Angulo, Carolina
    Luna, Antonio
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2024, 187
  • [5] Automated classification of brain MRI reports using fine-tuned large language models
    Kanzawa, Jun
    Yasaka, Koichiro
    Fujita, Nana
    Fujiwara, Shin
    Abe, Osamu
    NEURORADIOLOGY, 2024, 66 (12) : 2177 - 2183
  • [6] Data augmentation based on large language models for radiological report classification
    Collado-Montanez, Jaime
    Martin-Valdivia, Maria-Teresa
    Martinez-Camara, Eugenio
    KNOWLEDGE-BASED SYSTEMS, 2025, 308
  • [7] The use of large language models for program repair
    Zubair, Fida
    Al-Hitmi, Maryam
    Catal, Cagatay
    COMPUTER STANDARDS & INTERFACES, 2025, 93
  • [8] Large language models for efficient whole-organ MRI score-based reports and categorization in knee osteoarthritis
    Yuxue Xie
    Zhonghua Hu
    Hongyue Tao
    Yiwen Hu
    Haoyu Liang
    Xinmin Lu
    Lei Wang
    Xiangwen Li
    Shuang Chen
    Insights into Imaging, 16 (1)
  • [9] Large Language Models for Simplified Interventional Radiology Reports: A Comparative Analysis
    Can, Elif
    Uller, Wibke
    Vogt, Katharina
    Doppler, Michael C.
    Busch, Felix
    Bayerl, Nadine
    Ellmann, Stephan
    Kader, Avan
    Elkilany, Aboelyazid
    Makowski, Marcus R.
    Bressem, Keno K.
    Adams, Lisa C.
    ACADEMIC RADIOLOGY, 2025, 32 (02) : 888 - 898
  • [10] Comparative Evaluation of Large Language Models for Translating Radiology Reports into Hindi
    Gupta, Amit
    Rastogi, Ashish
    Malhotra, Hema
    Rangarajan, Krithika
    INDIAN JOURNAL OF RADIOLOGY AND IMAGING, 2025, 35 (01) : 88 - 96