Learning to match patients to clinical trials using large language models

被引:3
作者
Rybinski, Maciej [1 ]
Kusa, Wojciech [2 ]
Karimi, Sarvnaz [1 ]
Hanbury, Allan [2 ]
机构
[1] CSIRO Data61, 26 Pembroke Rd, Marsfield, NSW 2122, Australia
[2] TU Wien, Favoritenstr 9-11, A-1040 Vienna, Austria
基金
欧盟地平线“2020”;
关键词
Clinical trials; Patient to trials matching; TCRR; TREC CT; Large language models; Information retrieval; Learning-to-rank;
D O I
10.1016/j.jbi.2024.104734
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Objective: This study investigates the use of Large Language Models (LLMs) for matching patients to clinical trials (CTs) within an information retrieval pipeline. Our objective is to enhance the process of patient-trial matching by leveraging the semantic processing capabilities of LLMs, thereby improving the effectiveness of patient recruitment for clinical trials. Methods: We employed a multi-stage retrieval pipeline integrating various methodologies, including BM25 and Transformer-based rankers, along with LLM-based methods. Our primary datasets were the TREC Clinical Trials 2021-23 track collections. We compared LLM-based approaches, focusing on methods that leverage LLMs in query formulation, filtering, relevance ranking, and re-ranking of CTs. Results: Our results indicate that LLM-based systems, particularly those involving re-ranking with a fine-tuned LLM, outperform traditional methods in terms of nDCG and Precision measures. The study demonstrates that fine-tuning LLMs enhances their ability to find eligible trials. Moreover, our LLM-based approach is competitive with state-of-the-art systems in the TREC challenges. The study shows the effectiveness of LLMs in CT matching, highlighting their potential in handling complex semantic analysis and improving patient-trial matching. However, the use of LLMs increases the computational cost and reduces efficiency. We provide a detailed analysis of effectiveness-efficiency trade-offs. Conclusion: This research demonstrates the promising role of LLMs in enhancing the patient-to-clinical trial matching process, offering a significant advancement in the automation of patient recruitment. Future work should explore optimising the balance between computational cost and retrieval effectiveness in practical applications.
引用
收藏
页数:12
相关论文
共 50 条
[41]   Using Large Language Models to Improve Sentiment Analysis in Latvian Language [J].
Purvins, Pauls ;
Urtans, Evalds ;
Caune, Vairis .
BALTIC JOURNAL OF MODERN COMPUTING, 2024, 12 (02) :165-175
[42]   Utilizing Large Language Models for Ablation Studies in Machine Learning and Deep Learning [J].
Sheikholeslami, Sina ;
Ghasemirahni, Hamid ;
Payberah, Amir H. ;
Wang, Tianze ;
Dowling, Jim ;
Vlassov, Vladimir .
PROCEEDINGS OF THE 2025 THE 5TH WORKSHOP ON MACHINE LEARNING AND SYSTEMS, EUROMLSYS 2025, 2025, :230-237
[43]   Automated Paper Screening for Clinical Reviews Using Large Language Models: Data Analysis Study [J].
Guo, Eddie ;
Gupta, Mehul ;
Deng, Jiawen ;
Park, Ye-Jean ;
Paget, Michael ;
Naugler, Christopher .
JOURNAL OF MEDICAL INTERNET RESEARCH, 2024, 26
[44]   Using Large Language Models to Retrieve Critical Data from Clinical Processes and Business Rules [J].
Yu, Yunguo ;
Gomez-Cabello, Cesar A. ;
Makarova, Svetlana ;
Parte, Yogesh ;
Borna, Sahar ;
Haider, Syed Ali ;
Genovese, Ariana ;
Prabha, Srinivasagam ;
Forte, Antonio J. .
BIOENGINEERING-BASEL, 2025, 12 (01)
[45]   Utility of large language models for creating clinical assessment items [J].
Lam, George ;
Shammoon, Yusra ;
Coulson, Anna ;
Lalloo, Felicity ;
Maini, Arti ;
Amin, Anjali ;
Brown, Celia ;
Sam, Amir H. .
MEDICAL TEACHER, 2025, 47 (05) :878-882
[46]   Generalizable clinical note section identification with large language models [J].
Zhou, Weipeng ;
Miller, Timothy A. .
JAMIA OPEN, 2024, 7 (03)
[47]   Agile Project Management Using Large Language Models [J].
Dhruva, G. ;
Shettigar, Ishaan ;
Parthasarthy, Srikrshna ;
Sapna, V. M. .
2024 5TH INTERNATIONAL CONFERENCE ON INNOVATIVE TRENDS IN INFORMATION TECHNOLOGY, ICITIIT 2024, 2024,
[48]   Using large language models to create narrative events [J].
Bartalesi, Valentina ;
Lenzi, Emanuele ;
De Martino, Claudio .
PEERJ COMPUTER SCIENCE, 2024, 10
[49]   Corporate Event Predictions Using Large Language Models [J].
Xiao, Zhaomin ;
Mai, Zhelu ;
Xu, Zhuoer ;
Cui, Yachen ;
Li, Jiancheng .
2023 10TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE, ISCMI, 2023, :193-197
[50]   Using Large Language Models to Understand Telecom Standards [J].
Karapantelakis, Athanasios ;
Thakur, Mukesh ;
Nikou, Alexandros ;
Moradi, Farnaz ;
Olrog, Christian ;
Gaim, Fitsum ;
Holm, Henrik ;
Nimara, Doumitrou Daniil ;
Huang, Vincent .
2024 IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING FOR COMMUNICATION AND NETWORKING, ICMLCN 2024, 2024, :440-446