Machine learning reduced workload with minimal risk of missing studies: development and evaluation of a randomized controlled trial classifier for Cochrane Reviews

被引:102
作者
Thomas, James [1 ]
McDonald, Steve [2 ]
Noel-Storr, Anna [3 ,4 ]
Shemilt, Ian [1 ]
Elliott, Julian [5 ,6 ]
Mavergames, Chris [4 ]
Marshall, Iain J. [7 ]
机构
[1] UCL, EPPI Ctr, UCL Social Res Inst, London, England
[2] Monash Univ, Sch Publ Hlth & Prevent Med, Cochrane Australia, Melbourne, Vic, Australia
[3] Univ Oxford, Radcliffe Dept Med, London, England
[4] Cochrane, London, England
[5] Monash Univ, Dept Infect Dis, Melbourne, Vic, Australia
[6] Alfred Hosp, Melbourne, Vic, Australia
[7] Kings Coll London, Sch Populat Hlth & Environm Sci, London, England
基金
英国医学研究理事会;
关键词
Machine learning; Study classifiers; Searching; Information retrieval; Methods/methodology; Randomized controlled trials; Systematic reviews; Automation; Crowdsourcing; Cochrane Library; SYSTEMATIC REVIEWS;
D O I
10.1016/j.jclinepi.2020.11.003
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objectives: This study developed, calibrated, and evaluated a machine learning classifier designed to reduce study identification workload in Cochrane for producing systematic reviews. Methods: A machine learning classifier for retrieving randomized controlled trials (RCTs) was developed (the "Cochrane RCT Classifier''), with the algorithm trained using a data set of title-abstract records from Embase, manually labeled by the Cochrane Crowd. The classifier was then calibrated using a further data set of similar records manually labeled by the Clinical Hedges team, aiming for 99% recall. Finally, the recall of the calibrated classifier was evaluated using records of RCTs included in Cochrane Reviews that had abstracts of sufficient length to allow machine classification. Results: The Cochrane RCT Classifier was trained using 280,620 records (20,454 of which reported RCTs). A classification threshold was set using 49,025 calibration records (1,587 of which reported RCTs), and our bootstrap validation found the classifier had recall of 0.99 (95% confidence interval 0.98-0.99) and precision of 0.08 (95% confidence interval 0.06-0.12) in this data set. The final, calibrated RCT classifier correctly retrieved 43,783 (99.5%) of 44,007 RCTs included in Cochrane Reviews but missed 224 (0.5%). Older records were more likely to be missed than those more recently published. Conclusions: The Cochrane RCT Classifier can reduce manual study identification workload for Cochrane Reviews, with a very low and acceptable risk of missing eligible RCTs. This classifier now forms part of the Evidence Pipeline, an integrated workflow deployed within Cochrane to help improve the efficiency of the study identification processes that support systematic review production. (C) 2020 The Authors. Published by Elsevier Inc.
引用
收藏
页码:140 / 151
页数:12
相关论文
共 22 条
[1]  
[Anonymous], 1999, PROBABILISTIC OUTPUT
[2]  
[Anonymous], 2019, Cochrane Library
[3]   Seventy-Five Trials and Eleven Systematic Reviews a Day: How Will We Ever Keep Up? [J].
Bastian, Hilda ;
Glasziou, Paul ;
Chalmers, Iain .
PLOS MEDICINE, 2010, 7 (09)
[4]  
Brier G., 1950, MON WEATHER REV
[5]  
Cochrane, 2019, Cochrane Register of Studies (CRS)
[6]  
Fffff K.F., 2010, FFFBMC MED, V8, DOI [DOI 10.1186/1741-7015-8-18FFFF, 10.1186/1741-7015-8-18ffff]
[7]  
LefebvreC, 2019, Cochrane handbook for systematic reviews of interventions, P67, DOI [DOI 10.1002/9781119536604, 10.1002/9781119536604, DOI 10.1002/9781119536604.CH4]
[8]   Biomedical research: increasing value, reducing waste [J].
Macleod, Malcolm R. ;
Michie, Susan ;
Roberts, Ian ;
Dirnagl, Ulrich ;
Chalmers, Iain ;
Ioannidis, John P. A. ;
Salman, Rustam Al-Shahi ;
Chan, An-Wen ;
Glasziou, Paul .
LANCET, 2014, 383 (9912) :101-104
[9]   Machine learning for identifying Randomized Controlled Trials: An evaluation and practitioner's guide [J].
Marshall, Iain J. ;
Noel-Storr, Anna ;
Kuiper, Joel ;
Thomas, James ;
Wallace, Byron C. .
RESEARCH SYNTHESIS METHODS, 2018, 9 (04) :602-614
[10]   Retrieving randomized controlled trials from medline: a comparison of 38 published search filters [J].
McKibbon, Kathleen Ann ;
Wilczynski, Nancy Lou ;
Haynes, Robert Brian .
HEALTH INFORMATION AND LIBRARIES JOURNAL, 2009, 26 (03) :187-202