A scoping review of large language model based approaches for information extraction from radiology reports

被引:15
作者
Reichenpfader, Daniel [1 ,2 ]
Muller, Henning [3 ,4 ]
Denecke, Kerstin [1 ]
机构
[1] Bern Univ Appl Sci, Inst Patient Ctr Digital Hlth, Biel, Switzerland
[2] Univ Geneva, Fac Med, Geneva, Switzerland
[3] Univ Geneva, Dept Radiol & Med Informat, Geneva, Switzerland
[4] HES SO Valais Wallis, Informat Inst, Sierre, Switzerland
关键词
Metadata;
D O I
10.1038/s41746-024-01219-0
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Radiological imaging is a globally prevalent diagnostic method, yet the free text contained in radiology reports is not frequently used for secondary purposes. Natural Language Processing can provide structured data retrieved from these reports. This paper provides a summary of the current state of research on Large Language Model (LLM) based approaches for information extraction (IE) from radiology reports. We conduct a scoping review that follows the PRISMA-ScR guideline. Queries of five databases were conducted on August 1st 2023. Among the 34 studies that met inclusion criteria, only pre-transformer and encoder-based models are described. External validation shows a general performance decrease, although LLMs might improve generalizability of IE approaches. Reports related to CT and MRI examinations, as well as thoracic reports, prevail. Most common challenges reported are missing validation on external data and augmentation of the described methods. Different reporting granularities affect the comparability and transparency of approaches.
引用
收藏
页数:12
相关论文
共 122 条
[1]  
Abdin Marah, 2024, Phi-3 technical report: A highly capable language model locally on your phone
[2]   Leveraging GPT-4 for Post Hoc Transformation of Free-text Radiology Reports into Structured Reporting: A Multilingual Feasibility Study [J].
Adams, Lisa C. ;
Truhn, Daniel ;
Busch, Felix ;
Kader, Avan ;
Niehues, Stefan M. ;
Makowski, Marcus R. ;
Bressem, Keno K. .
RADIOLOGY, 2023, 307 (04)
[3]  
Agrawal M., 2022, P C EMP METH NAT LAN
[4]  
Alsentzer Emily., 2019, Proceedings of the 2nd Clinical Natural Language Processing Workshop, P72, DOI 10.18653/v1/W19-1909
[5]  
[Anonymous], 2024, Llama-3.Meta
[6]  
[Anonymous], 2021, Rayyan-AI Powered Tool for Systematic Literature Reviews
[7]  
Beltagy I, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P3615
[8]   Science in the age of large language models [J].
Birhane, Abeba ;
Kasirzadeh, Atoosa ;
Leslie, David ;
Wachter, Sandra .
NATURE REVIEWS PHYSICS, 2023, 5 (05) :277-280
[9]   medBERT.de: A comprehensive German BERT model for the medical domain [J].
Bressem, Keno K. ;
Papaioannou, Jens-Michalis ;
Grundmann, Paul ;
Borchert, Florian ;
Adams, Lisa C. ;
Liu, Leonhard ;
Busch, Felix ;
Xu, Lina ;
Loyen, Jan P. ;
Niehues, Stefan M. ;
Augustin, Moritz ;
Grosser, Lennart ;
Makowski, Marcus R. ;
Aerts, Hugo J. W. L. ;
Loeser, Alexander .
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237
[10]   Highly accurate classification of chest radiographic reports using a deep learning natural language model pre-trained on 3.8 million text reports [J].
Bressem, Keno K. ;
Adams, Lisa C. ;
Gaudin, Robert A. ;
Troeltzsch, Daniel ;
Hamm, Bernd ;
Makowski, Marcus R. ;
Schuele, Chan-Yong ;
Vahldiek, Janis L. ;
Niehues, Stefan M. .
BIOINFORMATICS, 2020, 36 (21) :5255-5261