Increasing the efficiency of trial-patient matching: automated clinical trial eligibility Pre-screening for pediatric oncology patients

被引:89
作者
Ni, Yizhao [1 ]
Wright, Jordan [2 ]
Perentesis, John [2 ]
Lingren, Todd [1 ]
Deleger, Louise [1 ]
Kaiser, Megan [1 ]
Kohane, Isaac [3 ]
Solti, Imre [1 ,4 ]
机构
[1] Cincinnati Childrens Hosp Med Ctr, Dept Biomed Informat, Cincinnati, OH 45229 USA
[2] Cincinnati Childrens Hosp Med Ctr, Canc & Blood Dis Inst, Cincinnati, OH 45229 USA
[3] Harvard Univ, Sch Med, Ctr Biomed Informat, Boston, MA USA
[4] Cincinnati Childrens Hosp Med Ctr, James M Anderson Ctr Hlth Syst Excellence, Cincinnati, OH 45229 USA
关键词
Automated clinical trial eligibility screening; Patient-trial matching; Natural language processing; Information extraction; RECRUITMENT; CHALLENGES; EXTRACTION; SYSTEM; TEXT;
D O I
10.1186/s12911-015-0149-3
中图分类号
R-058 [];
学科分类号
摘要
Background: Manual eligibility screening (ES) for a clinical trial typically requires a labor-intensive review of patient records that utilizes many resources. Leveraging state-of-the-art natural language processing (NLP) and information extraction (IE) technologies, we sought to improve the efficiency of physician decision-making in clinical trial enrollment. In order to markedly reduce the pool of potential candidates for staff screening, we developed an automated ES algorithm to identify patients who meet core eligibility characteristics of an oncology clinical trial. Methods: We collected narrative eligibility criteria from ClinicalTrials.gov for 55 clinical trials actively enrolling oncology patients in our institution between 12/01/2009 and 10/31/2011. In parallel, our ES algorithm extracted clinical and demographic information from the Electronic Health Record (EHR) data fields to represent profiles of all 215 oncology patients admitted to cancer treatment during the same period. The automated ES algorithm then matched the trial criteria with the patient profiles to identify potential trial-patient matches. Matching performance was validated on a reference set of 169 historical trial-patient enrollment decisions, and workload, precision, recall, negative predictive value (NPV) and specificity were calculated. Results: Without automation, an oncologist would need to review 163 patients per trial on average to replicate the historical patient enrollment for each trial. This workload is reduced by 85% to 24 patients when using automated ES (precision/recall/NPV/specificity: 12.6%/100.0%/100.0%/89.9%). Without automation, an oncologist would need to review 42 trials per patient on average to replicate the patient-trial matches that occur in the retrospective data set. With automated ES this workload is reduced by 90% to four trials (precision/recall/NPV/specificity: 35.7%/100.0%/100.0%/95.5%). Conclusion: By leveraging NLP and IE technologies, automated ES could dramatically increase the trial screening efficiency of oncologists and enable participation of small practices, which are often left out from trial enrollment. The algorithm has the potential to significantly reduce the effort to execute clinical research at a point in time when new initiatives of the cancer care community intend to greatly expand both the access to trials and the number of available trials.
引用
收藏
页数:10
相关论文
共 37 条
[1]  
[Anonymous], 2014, P 2 ASE INT C BIG DA
[2]   Supporting Patient Screening to Identify Suitable Clinical Trials [J].
Bucur, Anca ;
Van Leeuwen, Jasper ;
Chen, Njin-Zu ;
Claerhout, Brecht ;
De Schepper, Kristof ;
Perez-Rey, David ;
Alonso-Calvo, Raul ;
Pugliano, Lina ;
Saini, Kamal .
E-HEALTH - FOR CONTINUITY OF CARE, 2014, 205 :823-827
[3]  
Butte AJ, 2000, J AM MED INFORM ASSN, P111
[4]   Overcoming barriers to NLP for clinical text: the role of shared tasks and the need for additional creative solutions [J].
Chapman, Wendy W. ;
Nadkarni, Prakash M. ;
Hirschman, Lynette ;
D'Avolio, Leonard W. ;
Savova, Guergana K. ;
Uzuner, Ozlem .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2011, 18 (05) :540-543
[5]   A simple algorithm for identifying negated findings and diseases in discharge summaries [J].
Chapman, WW ;
Bridewell, W ;
Hanbury, P ;
Cooper, GF ;
Buchanan, BG .
JOURNAL OF BIOMEDICAL INFORMATICS, 2001, 34 (05) :301-310
[6]  
Crane K, 2010, NCI CANC B, V7, P8
[7]   Developing and evaluating an automated appendicitis risk stratification algorithm for pediatric patients in the emergency department [J].
Deleger, Louise ;
Brodzinski, Holly ;
Zhai, Haijun ;
Li, Qi ;
Lingren, Todd ;
Kirkendall, Eric S. ;
Alessandrini, Evaline ;
Solti, Imre .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2013, 20 (E2) :E212-E220
[8]  
Demner-Fushman D, 2011, 20 TEXT RETRIEVAL C
[9]   Key factors in the rising cost of new drug discovery and development [J].
Dickson, M ;
Gagnon, JP .
NATURE REVIEWS DRUG DISCOVERY, 2004, 3 (05) :417-429
[10]  
Ding Jing, 2008, AMIA Annu Symp Proc, P931