Influence of prior probability information on large language model performance in radiological diagnosis

被引:0
作者
Fukushima, Takahiro [1 ]
Kurokawa, Ryo [1 ]
Hagiwara, Akifumi [1 ]
Sonoda, Yuki [1 ]
Asari, Yusuke [1 ]
Kurokawa, Mariko [1 ]
Kanzawa, Jun [1 ]
Gonoi, Wataru [1 ]
Abe, Osamu [1 ]
机构
[1] Univ Tokyo, Grad Sch Med, Dept Radiol, 7-3-1 Hongo,Bunkyo Ku, Tokyo 1138655, Japan
关键词
Large language model; Artificial intelligence; Claude; 3.5; Sonnet; Bayes' theorem;
D O I
10.1007/s11604-025-01743-3
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
PurposeLarge language models (LLMs) show promise in radiological diagnosis, but their performance may be affected by the context of the cases presented. Our purpose is to investigate how providing information about prior probabilities influences the diagnostic performance of an LLM in radiological quiz cases.Materials and methodsWe analyzed 322 consecutive cases from Radiology's "Diagnosis Please" quiz using Claude 3.5 Sonnet under three conditions: without context (Condition 1), informed as quiz cases (Condition 2), and presented as primary care cases (Condition 3). Diagnostic accuracy was compared using McNemar's test.ResultsThe overall accuracy rate significantly improved in Condition 2 compared to Condition 1 (70.2% vs. 64.9%, p = 0.029). Conversely, the accuracy rate significantly decreased in Condition 3 compared to Condition 1 (59.9% vs. 64.9%, p = 0.027).ConclusionsProviding information that may influence prior probabilities significantly affects the diagnostic performance of the LLM in radiological cases. This suggests that LLMs may incorporate Bayesian-like principles and adjust the weighting of their diagnostic responses based on prior information, highlighting the potential for optimizing LLM's performance in clinical settings by providing relevant contextual information.
引用
收藏
页码:934 / 939
页数:6
相关论文
共 50 条
  • [1] Large Language Model as Unsupervised Health Information Retriever
    Jiang, Keyuan
    Mujtaba, Mohammed M.
    Bernard, Gordon R.
    CARING IS SHARING-EXPLOITING THE VALUE IN DATA FOR HEALTH AND INNOVATION-PROCEEDINGS OF MIE 2023, 2023, 302 : 833 - 834
  • [2] Large Language Model Powered Agents for Information Retrieval
    Zhang, An
    Deng, Yang
    Lin, Yankai
    Chen, Xu
    Wen, Ji-Rong
    Chua, Tat-Seng
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2989 - 2992
  • [3] Large Language Model Applications for Health Information Extraction in Oncology: Scoping Review
    Chen, David
    Alnassar, Saif Addeen
    Avison, Kate Elizabeth
    Huang, Ryan S.
    Raman, Srinivas
    JMIR CANCER, 2025, 11
  • [4] Large language model may assist diagnosis of SAPHO syndrome by bone scintigraphy
    Mori, Yu
    Izumiyama, Takuya
    Kanabuchi, Ryuichi
    Mori, Naoko
    Aizawa, Toshimi
    MODERN RHEUMATOLOGY, 2024, 34 (05) : 1043 - 1046
  • [5] Experience with Large Language Model Applications for Information Retrieval from Enterprise Proprietary Data
    Yu, Liang
    Alegroth, Emil
    Chatzipetrou, Panagiota
    Gorschek, Tony
    PRODUCT-FOCUSED SOFTWARE PROCESS IMPROVEMENT, PROFES 2024, 2025, 15452 : 92 - 107
  • [6] High-performance automated abstract screening with large language model ensembles
    Sanghera, Rohan
    Thirunavukarasu, Arun James
    El Khoury, Marc
    O'Logbon, Jessica
    Chen, Yuqing
    Watt, Archie
    Mahmood, Mustafa
    Butt, Hamid
    Nishimura, George
    Soltan, Andrew A. S.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2025,
  • [7] Query Expansion and Verification with Large Language Model for Information Retrieval
    Zhang, Wenjing
    Liu, Zhaoxiang
    Wang, Kai
    Lian, Shiguo
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT IV, ICIC 2024, 2024, 14878 : 341 - 351
  • [8] PneumoLLM: Harnessing the power of large language model for pneumoconiosis diagnosis
    Song, Meiyue
    Wang, Jiarui
    Yu, Zhihua
    Wang, Jiaxin
    Yang, Le
    Lu, Yuting
    Li, Baicun
    Wang, Xue
    Wang, Xiaoxu
    Huang, Qinghua
    Li, Zhijun
    Kanellakis, Nikolaos I.
    Liu, Jiangfeng
    Wang, Jing
    Wang, Binglu
    Yang, Juntao
    MEDICAL IMAGE ANALYSIS, 2024, 97
  • [9] Visual large language model for wheat disease diagnosis in the wild
    Zhang, Kunpeng
    Ma, Li
    Cui, Beibei
    Li, Xin
    Zhang, Boqiang
    Xie, Na
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2024, 227
  • [10] Performance of large language models (LLMs) in providing prostate cancer information
    Alasker, Ahmed
    Alsalamah, Seham
    Alshathri, Nada
    Almansour, Nura
    Alsalamah, Faris
    Alghafees, Mohammad
    Alkhamees, Mohammad
    Alsaikhan, Bader
    BMC UROLOGY, 2024, 24 (01):