Semi-automated screening of biomedical citations for systematic reviews

被引:210
|
作者
Wallace, Byron C. [1 ,2 ]
Trikalinos, Thomas A. [2 ]
Lau, Joseph [2 ]
Brodley, Carla [1 ]
Schmid, Christopher H. [3 ]
机构
[1] Tufts Univ, Dept Comp Sci, Medford, MA 02155 USA
[2] Tufts Med Ctr, Ctr Clin Evidence Synth, Inst Clin Res & Hlth Policy Studies, Boston, MA USA
[3] Tufts Med Ctr, Biostat Res Ctr, Inst Clin Res & Hlth Policy Studies, Boston, MA USA
来源
BMC BIOINFORMATICS | 2010年 / 11卷
关键词
TEXT CATEGORIZATION MODELS;
D O I
10.1186/1471-2105-11-55
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Systematic reviews address a specific clinical question by unbiasedly assessing and analyzing the pertinent literature. Citation screening is a time-consuming and critical step in systematic reviews. Typically, reviewers must evaluate thousands of citations to identify articles eligible for a given review. We explore the application of machine learning techniques to semi-automate citation screening, thereby reducing the reviewers' workload. Results: We present a novel online classification strategy for citation screening to automatically discriminate "relevant" from "irrelevant" citations. We use an ensemble of Support Vector Machines (SVMs) built over different feature-spaces (e.g., abstract and title text), and trained interactively by the reviewer(s). Semi-automating the citation screening process is difficult because any such strategy must identify all citations eligible for the systematic review. This requirement is made harder still due to class imbalance; there are far fewer "relevant" than "irrelevant" citations for any given systematic review. To address these challenges we employ a custom active-learning strategy developed specifically for imbalanced datasets. Further, we introduce a novel under-sampling technique. We provide experimental results over three real-world systematic review datasets, and demonstrate that our algorithm is able to reduce the number of citations that must be screened manually by nearly half in two of these, and by around 40% in the third, without excluding any of the citations eligible for the systematic review. Conclusions: We have developed a semi-automated citation screening algorithm for systematic reviews that has the potential to substantially reduce the number of citations reviewers have to manually screen, without compromising the quality and comprehensiveness of the review.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Semi-automated screening of biomedical citations for systematic reviews
    Byron C Wallace
    Thomas A Trikalinos
    Joseph Lau
    Carla Brodley
    Christopher H Schmid
    BMC Bioinformatics, 11
  • [2] Automated and Semi-Automated Approaches for Literature Searching, Screening, and Data Extraction for Systematic Reviews in Environmental Health
    Walker, V. R.
    Rooney, A. A.
    TOXICOLOGY LETTERS, 2021, 350 : S4 - S4
  • [3] Towards a Semi-Automated Approach for Systematic Literature Reviews Completed Research
    Denzler, Tim
    Enders, Martin Robert
    Akello, Patricia
    DIGITAL INNOVATION AND ENTREPRENEURSHIP (AMCIS 2021), 2021,
  • [4] Machine learning for semi-automated scoping reviews
    Mozgai, Sharon
    Kaurloto, Cari
    Winn, Jade
    Leeds, Andrew
    Heylen, Dirk
    Hartholt, Arno
    Scherer, Stefan
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2023, 19
  • [5] PERFORMANCE OF AUTOMATED SCREENING OF CITATIONS COMPARED TO HUMAN REVIEWERS IN SYSTEMATIC LITERATURE REVIEWS: A SYSTEMATIC LITERATURE REVIEW
    Lopes, R.
    Gauthier, G.
    Akhtar, O.
    Atanasov, P.
    VALUE IN HEALTH, 2018, 21 : S367 - S367
  • [6] Semi-Automated Argumentative Analysis of Online Product Reviews
    Wyner, Adam
    Schneider, Jodi
    Atkinson, Katie
    Bench-Capon, Trevor
    COMPUTATIONAL MODELS OF ARGUMENT, 2012, 245 : 43 - +
  • [7] Effectiveness and cost effectiveness of automated and semi-automated cervical screening devices: a systematic review of the literature
    Broadstock, M
    NEW ZEALAND MEDICAL JOURNAL, 2001, 114 (1135) : 311 - 313
  • [8] Decoding semi-automated title-abstract screening: findings from a convenience sample of reviews
    Gates, Allison
    Gates, Michelle
    DaRosa, Daniel
    Elliott, Sarah A.
    Pillay, Jennifer
    Rahman, Sholeh
    Ben Vandermeer
    Hartling, Lisa
    SYSTEMATIC REVIEWS, 2020, 9 (01)
  • [9] Decoding semi-automated title-abstract screening: findings from a convenience sample of reviews
    Allison Gates
    Michelle Gates
    Daniel DaRosa
    Sarah A. Elliott
    Jennifer Pillay
    Sholeh Rahman
    Ben Vandermeer
    Lisa Hartling
    Systematic Reviews, 9
  • [10] Faster title and abstract screening? Evaluating Abstrackr, a semi-automated online screening program for systematic reviewers
    Rathbone J.
    Hoffmann T.
    Glasziou P.
    Systematic Reviews, 4 (1)