Influence of prior probability information on large language model performance in radiological diagnosis

被引：0

作者：

Fukushima, Takahiro ^{[1
]}

Kurokawa, Ryo ^{[1
]}

Hagiwara, Akifumi ^{[1
]}

Sonoda, Yuki ^{[1
]}

Asari, Yusuke ^{[1
]}

Kurokawa, Mariko ^{[1
]}

Kanzawa, Jun ^{[1
]}

Gonoi, Wataru ^{[1
]}

Abe, Osamu ^{[1
]}

机构：

[1] Univ Tokyo, Grad Sch Med, Dept Radiol, 7-3-1 Hongo,Bunkyo Ku, Tokyo 1138655, Japan

来源：

JAPANESE JOURNAL OF RADIOLOGY | 2025年

关键词：

Large language model; Artificial intelligence; Claude; 3.5; Sonnet; Bayes' theorem;

D O I：

10.1007/s11604-025-01743-3

中图分类号：

R8 [特种医学]; R445 [影像诊断学];

学科分类号：

1002 ; 100207 ; 1009 ;

摘要：

PurposeLarge language models (LLMs) show promise in radiological diagnosis, but their performance may be affected by the context of the cases presented. Our purpose is to investigate how providing information about prior probabilities influences the diagnostic performance of an LLM in radiological quiz cases.Materials and methodsWe analyzed 322 consecutive cases from Radiology's "Diagnosis Please" quiz using Claude 3.5 Sonnet under three conditions: without context (Condition 1), informed as quiz cases (Condition 2), and presented as primary care cases (Condition 3). Diagnostic accuracy was compared using McNemar's test.ResultsThe overall accuracy rate significantly improved in Condition 2 compared to Condition 1 (70.2% vs. 64.9%, p = 0.029). Conversely, the accuracy rate significantly decreased in Condition 3 compared to Condition 1 (59.9% vs. 64.9%, p = 0.027).ConclusionsProviding information that may influence prior probabilities significantly affects the diagnostic performance of the LLM in radiological cases. This suggests that LLMs may incorporate Bayesian-like principles and adjust the weighting of their diagnostic responses based on prior information, highlighting the potential for optimizing LLM's performance in clinical settings by providing relevant contextual information.

引用

页码：934 / 939

页数：6

共 50 条

[1] Large Language Model as Unsupervised Health Information Retriever
Jiang, Keyuan
Mujtaba, Mohammed M.
Bernard, Gordon R.
CARING IS SHARING-EXPLOITING THE VALUE IN DATA FOR HEALTH AND INNOVATION-PROCEEDINGS OF MIE 2023, 2023, 302 : 833 - 834
[2] Large Language Model Powered Agents for Information Retrieval
Zhang, An
Deng, Yang
Lin, Yankai
Chen, Xu
Wen, Ji-Rong
Chua, Tat-Seng
PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2989 - 2992
[3] Large Language Model Applications for Health Information Extraction in Oncology: Scoping Review
Chen, David
Alnassar, Saif Addeen
Avison, Kate Elizabeth
Huang, Ryan S.
Raman, Srinivas
JMIR CANCER, 2025, 11
[4] Large language model may assist diagnosis of SAPHO syndrome by bone scintigraphy
Mori, Yu
Izumiyama, Takuya
Kanabuchi, Ryuichi
Mori, Naoko
Aizawa, Toshimi
MODERN RHEUMATOLOGY, 2024, 34 (05) : 1043 - 1046
[5] Experience with Large Language Model Applications for Information Retrieval from Enterprise Proprietary Data
Yu, Liang
Alegroth, Emil
Chatzipetrou, Panagiota
Gorschek, Tony
PRODUCT-FOCUSED SOFTWARE PROCESS IMPROVEMENT, PROFES 2024, 2025, 15452 : 92 - 107
[6] High-performance automated abstract screening with large language model ensembles
Sanghera, Rohan
Thirunavukarasu, Arun James
El Khoury, Marc
O'Logbon, Jessica
Chen, Yuqing
Watt, Archie
Mahmood, Mustafa
Butt, Hamid
Nishimura, George
Soltan, Andrew A. S.
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2025,
[7] Query Expansion and Verification with Large Language Model for Information Retrieval
Zhang, Wenjing
Liu, Zhaoxiang
Wang, Kai
Lian, Shiguo
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT IV, ICIC 2024, 2024, 14878 : 341 - 351
[8] PneumoLLM: Harnessing the power of large language model for pneumoconiosis diagnosis
Song, Meiyue
Wang, Jiarui
Yu, Zhihua
Wang, Jiaxin
Yang, Le
Lu, Yuting
Li, Baicun
Wang, Xue
Wang, Xiaoxu
Huang, Qinghua
Li, Zhijun
Kanellakis, Nikolaos I.
Liu, Jiangfeng
Wang, Jing
Wang, Binglu
Yang, Juntao
MEDICAL IMAGE ANALYSIS, 2024, 97
[9] Visual large language model for wheat disease diagnosis in the wild
Zhang, Kunpeng
Ma, Li
Cui, Beibei
Li, Xin
Zhang, Boqiang
Xie, Na
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2024, 227
[10] Performance of large language models (LLMs) in providing prostate cancer information
Alasker, Ahmed
Alsalamah, Seham
Alshathri, Nada
Almansour, Nura
Alsalamah, Faris
Alghafees, Mohammad
Alkhamees, Mohammad
Alsaikhan, Bader
BMC UROLOGY, 2024, 24 (01):

← 1 2 3 4 5 →