Empowering PET imaging reporting with retrieval-augmented large language models and reading reports database: a pilot single center study

被引:0
作者
Choi, Hongyoon [1 ,2 ,3 ]
Lee, Dongjoo [3 ]
Kang, Yeon-koo [1 ]
Suh, Minseok [1 ,2 ]
机构
[1] Seoul Natl Univ Hosp, Dept Nucl Med, 101 Daehak Ro, Seoul 03080, South Korea
[2] Seoul Natl Univ, Coll Med, Dept Nucl Med, Seoul, South Korea
[3] Portrai Inc, Seoul, South Korea
关键词
PET reports; Large language model; Retrieval-augmented generation; Artificial intelligence;
D O I
10.1007/s00259-025-07101-9
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
PurposeThe potential of Large Language Models (LLMs) in enhancing a variety of natural language tasks in clinical fields includes medical imaging reporting. This pilot study examines the efficacy of a retrieval-augmented generation (RAG) LLM system considering zero-shot learning capability of LLMs, integrated with a comprehensive database of PET reading reports, in improving reference to prior reports and decision making.MethodsWe developed a custom LLM framework with retrieval capabilities, leveraging a database of over 10 years of PET imaging reports from a single center. The system uses vector space embedding to facilitate similarity-based retrieval. Queries prompt the system to generate context-based answers and identify similar cases or differential diagnoses. From routine clinical PET readings, experienced nuclear medicine physicians evaluated the performance of system in terms of the relevance of queried similar cases and the appropriateness score of suggested potential diagnoses.ResultsThe system efficiently organized embedded vectors from PET reports, showing that imaging reports were accurately clustered within the embedded vector space according to the diagnosis or PET study type. Based on this system, a proof-of-concept chatbot was developed and showed the framework's potential in referencing reports of previous similar cases and identifying exemplary cases for various purposes. From routine clinical PET readings, 84.1% of the cases retrieved relevant similar cases, as agreed upon by all three readers. Using the RAG system, the appropriateness score of the suggested potential diagnoses was significantly better than that of the LLM without RAG. Additionally, it demonstrated the capability to offer differential diagnoses, leveraging the vast database to enhance the completeness and precision of generated reports.ConclusionThe integration of RAG LLM with a large database of PET imaging reports suggests the potential to support clinical practice of nuclear medicine imaging reading by various tasks of AI including finding similar cases and deriving potential diagnoses from them. This study underscores the potential of advanced AI tools in transforming medical imaging reporting practices.
引用
收藏
页码:2452 / 2462
页数:11
相关论文
共 24 条
  • [1] Large language models (LLM) and ChatGPT: what will the impact on nuclear medicine be?
    Alberts, Ian L.
    Mercolli, Lorenzo
    Pyka, Thomas
    Prenosil, George
    Shi, Kuangyu
    Rominger, Axel
    Afshar-Oromieh, Ali
    [J]. EUROPEAN JOURNAL OF NUCLEAR MEDICINE AND MOLECULAR IMAGING, 2023, 50 (06) : 1549 - 1552
  • [2] Artificial Hallucinations in ChatGPT: Implications in Scientific Writing
    Alkaissi, Hussam
    McFarlane, Samy I.
    [J]. CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (02)
  • [3] Content-based Image Retrieval by Using Deep Learning for Interstitial Lung Disease Diagnosis with Chest CT
    Choe, Jooae
    Hwang, Hye Jeon
    Seo, Joon Beom
    Lee, Sang Min
    Yun, Jihye
    Kim, Min-Ju
    Jeong, Jewon
    Lee, Youngsoo
    Jin, Kiok
    Park, Rohee
    Kim, Jihoon
    Jeon, Howook
    Kim, Namkug
    Yi, Jaeyoun
    Yu, Donghoon
    Kim, Byeongsoo
    [J]. RADIOLOGY, 2022, 302 (01) : 187 - 197
  • [4] ChatGPT in Nuclear Medicine Education
    Currie, Geoffrey
    Barry, Kym
    [J]. JOURNAL OF NUCLEAR MEDICINE TECHNOLOGY, 2023, 51 (03) : 247 - 254
  • [5] Doshi R, 2023, medRxiv, V2023
  • [6] Elkassem AA, 2023, Am J Roentgenol
  • [7] Integrating artificial intelligence and natural language processing for computer-assisted reporting and report understanding in nuclear cardiology
    Garcia, Ernest, V
    [J]. JOURNAL OF NUCLEAR CARDIOLOGY, 2023, 30 (03) : 1180 - 1190
  • [8] Extractive summarization of clinical trial descriptions
    Gulden, Christian
    Kirchner, Melanie
    Schuettler, Christina
    Hinderer, Marc
    Kampf, Marvin
    Prokosch, Hans-Ulrich
    Toddenroth, Dennis
    [J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2019, 129 : 114 - 121
  • [9] Domain-adapted Large Language Models for Classifying Nuclear Medicine Reports
    Huemann, Zachary
    Lee, Changhee
    Hu, Junjie
    Cho, Steve Y.
    Bradshaw, Tyler J.
    [J]. RADIOLOGY-ARTIFICIAL INTELLIGENCE, 2023, 5 (06)
  • [10] Survey of Hallucination in Natural Language Generation
    Ji, Ziwei
    Lee, Nayeon
    Frieske, Rita
    Yu, Tiezheng
    Su, Dan
    Xu, Yan
    Ishii, Etsuko
    Bang, Ye Jin
    Madotto, Andrea
    Fung, Pascale
    [J]. ACM COMPUTING SURVEYS, 2023, 55 (12)