Adapting Large Language Models for Automatic Annotation of Radiology Reports for Metastases Detection

被引：0

作者：

Barabadi, Maede Ashofteh ^{[1
,2
]}

Chan, Wai Yip ^{[1
,2
]}

Zhu, Xiaodan ^{[1
,2
]}

Simpson, Amber L. ^{[3
,4
]}

Do, Richard K. G. ^{[5
]}

机构：

[1] Queens Univ, Dept Elect & Comp Engn, Kingston, ON, Canada

[2] Queens Univ, Ingenu Labs Res Inst, Kingston, ON, Canada

[3] Queens Univ, Sch Comp, Kingston, ON, Canada

[4] Queens Univ, Dept Biomed & Mol Sci, Kingston, ON, Canada

[5] Mem Sloan Kettering Canc Ctr, Dept Radiol, 1275 York Ave, New York, NY 10021 USA

来源：

2024 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CCECE 2024 | 2024年

关键词：

metastases detection; natural language processing; parameter-efficient tuning; large language models;

D O I：

10.1109/CCECE59415.2024.10667245

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Automatic identification of metastatic sites in cancer patients from electronic health records is a challenging yet crucial task with significant implications for diagnosis and treatment. In this study, we propose a method to detect metastases from non-structured radiology report texts by accessing only their impression section. We build models based on pre-trained large language models and parameter-efficient fine-tuning. We compare model performances between utilizing non-structured reports and reports following institutional-level templates. By incorporating patient historical data and their timeline into the model, we bridge the gap between structured and non-structured reports. Our experiments are conducted on data gathered at Memorial Sloan Kettering Cancer Center (MSKCC) which have been annotated for metastases presence in three organs: liver, lung, and adrenal glands. Our results suggest that access to previous reports significantly improves model performance, with an average improvement of 7.7 points in terms of F1-score over all datasets. Additionally, incorporating temporal information enhances the accuracy of metastasis detection by 0.4 and 1.1 points on liver and adrenal glands data, respectively. Our method shows potential for automating radiology report labeling on a large scale in an efficient manner, with the potential to deploy on low-cost hardware.

引用

页码：340 / 345

页数：6

共 14 条

[1] Barabadi M. A., 2023, P CAN C ART INT JAN, DOI [10.21428/594757db.8beel2fd, DOI 10.21428/594757DB.8BEEL2FD]
[2] Developing a Cancer Digital Twin: Supervised Metastases Detection From Consecutive Structured Radiology Reports
Batch, Karen E.
Yue, Jianwei
Darcovich, Alex
Lupton, Kaelan
Liu, Corinne C.
Woodlock, David P.
El Amine, Mohammad Ali K.
Causa-Andrieu, Pamela I.
Gazit, Lior
Nguyen, Gary H.
Zulkernine, Farhana
Do, Richard K. G.
Simpson, Amber L.
[J]. FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2022, 5
[3] Cai XR, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P3984
[4] Devlin J, 2019, Arxiv, DOI arXiv:1810.04805
[5] Parameter-efficient fine-tuning of large-scale pre-trained language models
Ding, Ning
Qin, Yujia
Yang, Guang
Wei, Fuchao
Yang, Zonghan
Su, Yusheng
Hu, Shengding
Chen, Yulin
Chan, Chi-Min
Chen, Weize
Yi, Jing
Zhao, Weilin
Wang, Xiaozhi
Liu, Zhiyuan
Zheng, Hai-Tao
Chen, Jianfei
Liu, Yang
Tang, Jie
Li, Juanzi
Sun, Maosong
[J]. NATURE MACHINE INTELLIGENCE, 2023, 5 (03) : 220 - +
[6] Patterns of Metastatic Disease in Patients with Cancer Derived from Natural Language Processing of Structured CT Radiology Reports over a 10-year Period
Do, Richard K. G.
Lupton, Kaelan
Andrieu, Pamela I. Causa
Luthra, Anisha
Taya, Michio
Batch, Karen
Nguyen, Huy
Rahurkar, Prachi
Gazit, Lior
Nicholas, Kevin
Fong, Christopher J.
Gangai, Natalie
Schultz, Nikolaus
Zulkernine, Farhana
Sevilimedu, Varadan
Juluru, Krishna
Simpson, Amber
Hricak, Hedvig
[J]. RADIOLOGY, 2021, 301 (01) : 115 - 122
[7] Hu Edward J., 2021, LORA LOW RANK ADAPTA
[8] Lester B, 2021, 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), P3045
[9] Li XLS, 2021, 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, P4582
[10] Liu X, 2022, Arxiv, DOI [arXiv:2110.07602, DOI 10.48550/ARXIV.2110.07602]

← 1 2 →