Analysis of eligibility criteria clusters based on large language models for clinical trial design

被引：1

作者：

Bornet, Alban ^{[1
]}

Khlebnikov, Philipp ^{[2
]}

Meer, Florian ^{[2
]}

Haas, Quentin ^{[2
]}

Yazdani, Anthony ^{[1
]}

Zhang, Boya ^{[1
]}

Amini, Poorya ^{[2
]}

Teodoro, Douglas ^{[1
]}

机构：

[1] Univ Geneva, Dept Radiol & Med Informat, G6-N3,9 Chemin Mines,Campus Biotech, CH-1202 Geneva, Switzerland

[2] Risklick AG, CH-3013 Bern, Switzerland

来源：

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION | 2024年 / 32卷 / 03期

关键词：

clinical trials; eligibility criteria; natural language processing (NLP); LLMs; clustering; topic modeling; RANDOMIZED CONTROLLED-TRIALS; REGRESSION; EXTRACTION; SELECTION;

D O I：

10.1093/jamia/ocae311

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Objectives Clinical trials (CTs) are essential for improving patient care by evaluating new treatments' safety and efficacy. A key component in CT protocols is the study population defined by the eligibility criteria. This study aims to evaluate the effectiveness of large language models (LLMs) in encoding eligibility criterion information to support CT-protocol design.Materials and Methods We extracted eligibility criterion sections, phases, conditions, and interventions from CT protocols available in the ClinicalTrials.gov registry. Eligibility sections were split into individual rules using a criterion tokenizer and embedded using LLMs. The obtained representations were clustered. The quality and relevance of the clusters for protocol design was evaluated through 3 experiments: intrinsic alignment with protocol information and human expert cluster coherence assessment, extrinsic evaluation through CT-level classification tasks, and eligibility section generation.Results Sentence embeddings fine-tuned using biomedical corpora produce clusters with the highest alignment to CT-level information. Human expert evaluation confirms that clusters are well structured and coherent. Despite the high information compression, clusters retain significant CT information, up to 97% of the classification performance obtained with raw embeddings. Finally, eligibility sections automatically generated using clusters achieve 95% of the ROUGE scores obtained with a generative LLM prompted with CT-protocol details, suggesting that clusters encapsulate information useful to CT-protocol design.Discussion Clusters derived from sentence-level LLM embeddings effectively summarize complex eligibility criterion data while retaining relevant CT-protocol details. Clustering-based approaches provide a scalable enhancement in CT design that balances information compression with accuracy.Conclusions Clustering eligibility criteria using LLM embeddings provides a practical and efficient method to summarize critical protocol information. We provide an interactive visualization of the pipeline here.

引用

页码：447 / 458

页数：12

共 58 条

[1] Optuna: A Next-generation Hyperparameter Optimization Framework [J].

Akiba, Takuya ;

Sano, Shotaro ;

Yanase, Toshihiko ;

Ohta, Takeru ;

Koyama, Masanori .

KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, :2623-2631

[2]

Bergstra J., 2011, ADV NEURAL INFORM PR, V24, P2546, DOI DOI 10.5555/2986459.2986743

[3] Prediction of clinical trial enrollment rates [J].

Bieganek, Cameron ;

Aliferis, Constantin ;

Ma, Sisi .

PLOS ONE, 2022, 17 (02)

[4]

Bornet A., 2023, MEDRXIV, P2023

[5]

Britton A, 1999, J Health Serv Res Policy, V4, P112

[6] Learning Eligibility in Cancer Clinical Trials Using Deep Neural Networks [J].

Bustos, Aurelia ;

Pertusa, Antonio .

APPLIED SCIENCES-BASEL, 2018, 8 (07)

[7] Classifying Eligibility Criteria in Clinical Trials Using Active Deep Learning [J].

Chuan, Ching-Hua .

2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, :305-310

[8]

Deka P, 2022, J DATA INTELL, P474, DOI [DOI 10.26421/JDI3.4-5, 10.26421/JDI3.4-5]

[9]

Desai Mira, 2020, Perspect Clin Res, V11, P51, DOI 10.4103/picr.PICR_6_20

[10]

Devlin J., 2018, ARXIV

← 1 2 3 4 5 6 →