Pinpointing needles in giant haystacks: use of text mining to reduce impractical screening workload in extremely large scoping reviews

被引:95
作者
Shemilt, Ian [1 ]
Simon, Antonia [2 ]
Hollands, Gareth J. [1 ]
Marteau, Theresa M. [1 ]
Ogilvie, David [1 ]
O'Mara-Eves, Alison [3 ]
Kelly, Michael P. [4 ]
Thomas, James [3 ]
机构
[1] Univ Cambridge, Behaviour & Hlth Res Unit, Cambridge CB2 0SR, England
[2] Inst Educ, Thomas Coram Res Unit, Dept Children & Hlth, London, England
[3] Inst Educ, Evidence Policy & Practice Informat & Coordinat C, Dept Children & Hlth, London, England
[4] Natl Inst Hlth & Care Excellence, Ctr Publ Hlth, London, England
关键词
text mining; scoping review methods; systematic review methods; study selection; SYSTEMATIC REVIEWS; CLASSIFICATION;
D O I
10.1002/jrsm.1093
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In scoping reviews, boundaries of relevant evidence may be initially fuzzy, with refined conceptual understanding of interventions and their proposed mechanisms of action an intended output of the scoping process rather than its starting point. Electronic searches are therefore sensitive, often retrieving very large record sets that are impractical to screen in their entirety. This paper describes methods for applying and evaluating the use of text mining (TM) technologies to reduce impractical screening workload in reviews, using examples of two extremely large-scale scoping reviews of public health evidence (choice architecture (CA) and economic environment (EE)). Electronic searches retrieved >800,000 (CA) and >1 million (EE) records. TM technologies were used to prioritise records for manual screening. TM performance was measured prospectively. TM reduced manual screening workload by 90% (CA) and 88% (EE) compared with conventional screening (absolute reductions of approximate to 430 000 (CA) and approximate to 378 000 (EE) records). This study expands an emerging corpus of empirical evidence for the use of TM to expedite study selection in reviews. By reducing screening workload to manageable levels, TM made it possible to assemble and configure large, complex evidence bases that crossed research discipline boundaries. These methods are transferable to other scoping and systematic reviews incorporating conceptual development or explanatory dimensions. (C) 2013 The Authors. Research Synthesis Methods published by John Wiley & Sons, Ltd.
引用
收藏
页码:31 / 49
页数:19
相关论文
共 42 条
  • [1] Supporting Systematic Reviews Using Text Mining
    Ananiadou, Sophia
    Rea, Brian
    Okazaki, Naoaki
    Procter, Rob
    Thomas, James
    [J]. SOCIAL SCIENCE COMPUTER REVIEW, 2009, 27 (04) : 509 - 523
  • [2] [Anonymous], 2013, EFFECTS CHANGES EC E
  • [3] [Anonymous], 20 COCHR C AUCK NZ
  • [4] [Anonymous], P 28 INT C MACH LEAR
  • [5] [Anonymous], 2010, APPL BEH INS HLTH BE
  • [6] [Anonymous], IDENTIFYING REV EVID
  • [7] [Anonymous], PRIMARY HLTH CAR RES
  • [8] [Anonymous], INTRO SYSTEMATIC REV
  • [9] [Anonymous], 2012, VALUE BENEFIT TEXT M
  • [10] [Anonymous], INT J DIGITAL LIB