Influence of prior probability information on large language model performance in radiological diagnosis

被引：0

作者：

Fukushima, Takahiro ^{[1
]}

Kurokawa, Ryo ^{[1
]}

Hagiwara, Akifumi ^{[1
]}

Sonoda, Yuki ^{[1
]}

Asari, Yusuke ^{[1
]}

Kurokawa, Mariko ^{[1
]}

Kanzawa, Jun ^{[1
]}

Gonoi, Wataru ^{[1
]}

Abe, Osamu ^{[1
]}

机构：

[1] Univ Tokyo, Grad Sch Med, Dept Radiol, 7-3-1 Hongo,Bunkyo Ku, Tokyo 1138655, Japan

来源：

JAPANESE JOURNAL OF RADIOLOGY | 2025年

关键词：

Large language model; Artificial intelligence; Claude; 3.5; Sonnet; Bayes' theorem;

D O I：

10.1007/s11604-025-01743-3

中图分类号：

R8 [特种医学]; R445 [影像诊断学];

学科分类号：

1002 ; 100207 ; 1009 ;

摘要：

PurposeLarge language models (LLMs) show promise in radiological diagnosis, but their performance may be affected by the context of the cases presented. Our purpose is to investigate how providing information about prior probabilities influences the diagnostic performance of an LLM in radiological quiz cases.Materials and methodsWe analyzed 322 consecutive cases from Radiology's "Diagnosis Please" quiz using Claude 3.5 Sonnet under three conditions: without context (Condition 1), informed as quiz cases (Condition 2), and presented as primary care cases (Condition 3). Diagnostic accuracy was compared using McNemar's test.ResultsThe overall accuracy rate significantly improved in Condition 2 compared to Condition 1 (70.2% vs. 64.9%, p = 0.029). Conversely, the accuracy rate significantly decreased in Condition 3 compared to Condition 1 (59.9% vs. 64.9%, p = 0.027).ConclusionsProviding information that may influence prior probabilities significantly affects the diagnostic performance of the LLM in radiological cases. This suggests that LLMs may incorporate Bayesian-like principles and adjust the weighting of their diagnostic responses based on prior information, highlighting the potential for optimizing LLM's performance in clinical settings by providing relevant contextual information.

引用

页码：934 / 939

页数：6

共 50 条

[31] Performance of a Large Language Model on Japanese Emergency Medicine Board Certification Examinations
Igarashi, Yutaka
Nakahara, Kyoichi
Norii, Tatsuya
Miyake, Nodoka
Tagami, Takashi
Yokobori, Shoji
JOURNAL OF NIPPON MEDICAL SCHOOL, 2024, 91 (02) : 155 - 161
[32] Large language model evaluation for high-performance computing software development
Godoy, William F.
Valero-Lara, Pedro
Teranishi, Keita
Balaprakash, Prasanna
Vetter, Jeffrey S.
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (26)
[33] Development and Performance of a Large Language Model for the Quality Evaluation of Multi-Language Medical Imaging Guidelines and Consensus
Wang, Zhixiang
Sun, Jing
Liu, Hui
Luo, Xufei
Li, Jia
He, Wenjun
Yang, Zhenhua
Lv, Han
Chen, Yaolong
Wang, Zhenchang
JOURNAL OF EVIDENCE BASED MEDICINE, 2025, 18 (02)
[34] Artificial intelligence large language model ChatGPT: is it a trustworthy and reliable source of information for sarcoma patients?
Valentini, Marisa
Szkandera, Joanna
Smolle, Maria
Scheipl, Susanne
Leithner, Andreas
Andreou, Dimosthenis
FRONTIERS IN PUBLIC HEALTH, 2024, 12
[35] Multimodal Large Language Model-Based Fault Detection and Diagnosis in Context of Industry 4.0
Alsaif, Khalid M.
Albeshri, Aiiad A.
Khemakhem, Maher A.
Eassa, Fathy E.
ELECTRONICS, 2024, 13 (24):
[36] Performance of a Large Language Model in the Generation of Clinical Guidelines for Antibiotic Prophylaxis in Spine Surgery
Zaidat, Bashar
Shrestha, Nancy
Rosenberg, Ashley M.
Ahmed, Wasil
Rajjoub, Rami
Hoang, Timothy
Mejia, Mateo Restrepo
Duey, Akiro H.
Tang, Justin E.
Kim, Jun S.
Cho, Samuel K.
NEUROSPINE, 2024, 21 (01) : 128 - 146
[37] Performance of a novel multimodal large language model in ınterpreting meibomian glands quantitatively and qualitatively
Pelin Kiyat
Melis Palamar
International Ophthalmology, 45 (1)
[38] A chatbot based question and answer system for the auxiliary diagnosis of chronic diseases based on large language model
Zhang, Sainan
Song, Jisung
SCIENTIFIC REPORTS, 2024, 14 (01):
[39] Large language model assisted fine-grained knowledge graph construction for robotic fault diagnosis
Liao, Xingming
Chen, Chong
Wang, Zhuowei
Liu, Ying
Wang, Tao
Cheng, Lianglun
ADVANCED ENGINEERING INFORMATICS, 2025, 65
[40] Information extraction of UV-NIR spectral data in waste water based on Large Language Model
Liang, Jiheng
Yu, Xiangyang
Hong, Weibin
Cai, Yefan
SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2024, 318

← 1 2 3 4 5 →