Influence of prior probability information on large language model performance in radiological diagnosis

被引:0
作者
Fukushima, Takahiro [1 ]
Kurokawa, Ryo [1 ]
Hagiwara, Akifumi [1 ]
Sonoda, Yuki [1 ]
Asari, Yusuke [1 ]
Kurokawa, Mariko [1 ]
Kanzawa, Jun [1 ]
Gonoi, Wataru [1 ]
Abe, Osamu [1 ]
机构
[1] Univ Tokyo, Grad Sch Med, Dept Radiol, 7-3-1 Hongo,Bunkyo Ku, Tokyo 1138655, Japan
关键词
Large language model; Artificial intelligence; Claude; 3.5; Sonnet; Bayes' theorem;
D O I
10.1007/s11604-025-01743-3
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
PurposeLarge language models (LLMs) show promise in radiological diagnosis, but their performance may be affected by the context of the cases presented. Our purpose is to investigate how providing information about prior probabilities influences the diagnostic performance of an LLM in radiological quiz cases.Materials and methodsWe analyzed 322 consecutive cases from Radiology's "Diagnosis Please" quiz using Claude 3.5 Sonnet under three conditions: without context (Condition 1), informed as quiz cases (Condition 2), and presented as primary care cases (Condition 3). Diagnostic accuracy was compared using McNemar's test.ResultsThe overall accuracy rate significantly improved in Condition 2 compared to Condition 1 (70.2% vs. 64.9%, p = 0.029). Conversely, the accuracy rate significantly decreased in Condition 3 compared to Condition 1 (59.9% vs. 64.9%, p = 0.027).ConclusionsProviding information that may influence prior probabilities significantly affects the diagnostic performance of the LLM in radiological cases. This suggests that LLMs may incorporate Bayesian-like principles and adjust the weighting of their diagnostic responses based on prior information, highlighting the potential for optimizing LLM's performance in clinical settings by providing relevant contextual information.
引用
收藏
页码:934 / 939
页数:6
相关论文
共 50 条
  • [31] Performance of a Large Language Model on Japanese Emergency Medicine Board Certification Examinations
    Igarashi, Yutaka
    Nakahara, Kyoichi
    Norii, Tatsuya
    Miyake, Nodoka
    Tagami, Takashi
    Yokobori, Shoji
    JOURNAL OF NIPPON MEDICAL SCHOOL, 2024, 91 (02) : 155 - 161
  • [32] Large language model evaluation for high-performance computing software development
    Godoy, William F.
    Valero-Lara, Pedro
    Teranishi, Keita
    Balaprakash, Prasanna
    Vetter, Jeffrey S.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (26)
  • [33] Development and Performance of a Large Language Model for the Quality Evaluation of Multi-Language Medical Imaging Guidelines and Consensus
    Wang, Zhixiang
    Sun, Jing
    Liu, Hui
    Luo, Xufei
    Li, Jia
    He, Wenjun
    Yang, Zhenhua
    Lv, Han
    Chen, Yaolong
    Wang, Zhenchang
    JOURNAL OF EVIDENCE BASED MEDICINE, 2025, 18 (02)
  • [34] Artificial intelligence large language model ChatGPT: is it a trustworthy and reliable source of information for sarcoma patients?
    Valentini, Marisa
    Szkandera, Joanna
    Smolle, Maria
    Scheipl, Susanne
    Leithner, Andreas
    Andreou, Dimosthenis
    FRONTIERS IN PUBLIC HEALTH, 2024, 12
  • [35] Multimodal Large Language Model-Based Fault Detection and Diagnosis in Context of Industry 4.0
    Alsaif, Khalid M.
    Albeshri, Aiiad A.
    Khemakhem, Maher A.
    Eassa, Fathy E.
    ELECTRONICS, 2024, 13 (24):
  • [36] Performance of a Large Language Model in the Generation of Clinical Guidelines for Antibiotic Prophylaxis in Spine Surgery
    Zaidat, Bashar
    Shrestha, Nancy
    Rosenberg, Ashley M.
    Ahmed, Wasil
    Rajjoub, Rami
    Hoang, Timothy
    Mejia, Mateo Restrepo
    Duey, Akiro H.
    Tang, Justin E.
    Kim, Jun S.
    Cho, Samuel K.
    NEUROSPINE, 2024, 21 (01) : 128 - 146
  • [37] Performance of a novel multimodal large language model in ınterpreting meibomian glands quantitatively and qualitatively
    Pelin Kiyat
    Melis Palamar
    International Ophthalmology, 45 (1)
  • [38] A chatbot based question and answer system for the auxiliary diagnosis of chronic diseases based on large language model
    Zhang, Sainan
    Song, Jisung
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [39] Large language model assisted fine-grained knowledge graph construction for robotic fault diagnosis
    Liao, Xingming
    Chen, Chong
    Wang, Zhuowei
    Liu, Ying
    Wang, Tao
    Cheng, Lianglun
    ADVANCED ENGINEERING INFORMATICS, 2025, 65
  • [40] Information extraction of UV-NIR spectral data in waste water based on Large Language Model
    Liang, Jiheng
    Yu, Xiangyang
    Hong, Weibin
    Cai, Yefan
    SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2024, 318