Adapting Large Language Models for Automatic Annotation of Radiology Reports for Metastases Detection

被引:0
作者
Barabadi, Maede Ashofteh [1 ,2 ]
Chan, Wai Yip [1 ,2 ]
Zhu, Xiaodan [1 ,2 ]
Simpson, Amber L. [3 ,4 ]
Do, Richard K. G. [5 ]
机构
[1] Queens Univ, Dept Elect & Comp Engn, Kingston, ON, Canada
[2] Queens Univ, Ingenu Labs Res Inst, Kingston, ON, Canada
[3] Queens Univ, Sch Comp, Kingston, ON, Canada
[4] Queens Univ, Dept Biomed & Mol Sci, Kingston, ON, Canada
[5] Mem Sloan Kettering Canc Ctr, Dept Radiol, 1275 York Ave, New York, NY 10021 USA
来源
2024 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CCECE 2024 | 2024年
关键词
metastases detection; natural language processing; parameter-efficient tuning; large language models;
D O I
10.1109/CCECE59415.2024.10667245
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Automatic identification of metastatic sites in cancer patients from electronic health records is a challenging yet crucial task with significant implications for diagnosis and treatment. In this study, we propose a method to detect metastases from non-structured radiology report texts by accessing only their impression section. We build models based on pre-trained large language models and parameter-efficient fine-tuning. We compare model performances between utilizing non-structured reports and reports following institutional-level templates. By incorporating patient historical data and their timeline into the model, we bridge the gap between structured and non-structured reports. Our experiments are conducted on data gathered at Memorial Sloan Kettering Cancer Center (MSKCC) which have been annotated for metastases presence in three organs: liver, lung, and adrenal glands. Our results suggest that access to previous reports significantly improves model performance, with an average improvement of 7.7 points in terms of F1-score over all datasets. Additionally, incorporating temporal information enhances the accuracy of metastasis detection by 0.4 and 1.1 points on liver and adrenal glands data, respectively. Our method shows potential for automating radiology report labeling on a large scale in an efficient manner, with the potential to deploy on low-cost hardware.
引用
收藏
页码:340 / 345
页数:6
相关论文
共 14 条
  • [1] Barabadi M. A., 2023, P CAN C ART INT JAN, DOI [10.21428/594757db.8beel2fd, DOI 10.21428/594757DB.8BEEL2FD]
  • [2] Developing a Cancer Digital Twin: Supervised Metastases Detection From Consecutive Structured Radiology Reports
    Batch, Karen E.
    Yue, Jianwei
    Darcovich, Alex
    Lupton, Kaelan
    Liu, Corinne C.
    Woodlock, David P.
    El Amine, Mohammad Ali K.
    Causa-Andrieu, Pamela I.
    Gazit, Lior
    Nguyen, Gary H.
    Zulkernine, Farhana
    Do, Richard K. G.
    Simpson, Amber L.
    [J]. FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2022, 5
  • [3] Cai XR, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P3984
  • [4] Devlin J, 2019, Arxiv, DOI arXiv:1810.04805
  • [5] Parameter-efficient fine-tuning of large-scale pre-trained language models
    Ding, Ning
    Qin, Yujia
    Yang, Guang
    Wei, Fuchao
    Yang, Zonghan
    Su, Yusheng
    Hu, Shengding
    Chen, Yulin
    Chan, Chi-Min
    Chen, Weize
    Yi, Jing
    Zhao, Weilin
    Wang, Xiaozhi
    Liu, Zhiyuan
    Zheng, Hai-Tao
    Chen, Jianfei
    Liu, Yang
    Tang, Jie
    Li, Juanzi
    Sun, Maosong
    [J]. NATURE MACHINE INTELLIGENCE, 2023, 5 (03) : 220 - +
  • [6] Patterns of Metastatic Disease in Patients with Cancer Derived from Natural Language Processing of Structured CT Radiology Reports over a 10-year Period
    Do, Richard K. G.
    Lupton, Kaelan
    Andrieu, Pamela I. Causa
    Luthra, Anisha
    Taya, Michio
    Batch, Karen
    Nguyen, Huy
    Rahurkar, Prachi
    Gazit, Lior
    Nicholas, Kevin
    Fong, Christopher J.
    Gangai, Natalie
    Schultz, Nikolaus
    Zulkernine, Farhana
    Sevilimedu, Varadan
    Juluru, Krishna
    Simpson, Amber
    Hricak, Hedvig
    [J]. RADIOLOGY, 2021, 301 (01) : 115 - 122
  • [7] Hu Edward J., 2021, LORA LOW RANK ADAPTA
  • [8] Lester B, 2021, 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), P3045
  • [9] Li XLS, 2021, 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, P4582
  • [10] Liu X, 2022, Arxiv, DOI [arXiv:2110.07602, DOI 10.48550/ARXIV.2110.07602]