Risk of bias assessment in preclinical literature using natural language processing

被引:12
|
作者
Wang, Qianying [1 ]
Liao, Jing [1 ]
Lapata, Mirella [2 ]
Macleod, Malcolm [1 ]
机构
[1] Univ Edinburgh, Ctr Clin Brain Sci, 49 Little France Crescent, Edinburgh EH16 4SB, Midlothian, Scotland
[2] Univ Edinburgh, Sch Informat, Edinburgh, Midlothian, Scotland
关键词
automatic assessment; natural language processing; preclinical research synthesis; risk of bias;
D O I
10.1002/jrsm.1533
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We sought to apply natural language processing to the task of automatic risk of bias assessment in preclinical literature, which could speed the process of systematic review, provide information to guide research improvement activity, and support translation from preclinical to clinical research. We use 7840 full-text publications describing animal experiments with yes/no annotations for five risk of bias items. We implement a series of models including baselines (support vector machine, logistic regression, random forest), neural models (convolutional neural network, recurrent neural network with attention, hierarchical neural network) and models using BERT with two strategies (document chunk pooling and sentence extraction). We tune hyperparameters to obtain the highest F1 scores for each risk of bias item on the validation set and compare evaluation results on the test set to our previous regular expression approach. The F1 scores of best models on test set are 82.0% for random allocation, 81.6% for blinded assessment of outcome, 82.6% for conflict of interests, 91.4% for compliance with animal welfare regulations and 46.6% for reporting animals excluded from analysis. Our models significantly outperform regular expressions for four risk of bias items. For random allocation, blinded assessment of outcome, conflict of interests and animal exclusions, neural models achieve good performance; for animal welfare regulations, BERT model with a sentence extraction strategy works better. Convolutional neural networks are the overall best models. The tool is publicly available which may contribute to the future monitoring of risk of bias reporting for research improvement activities.
引用
收藏
页码:368 / 380
页数:13
相关论文
共 50 条
  • [21] Program Synthesis and Natural Language Processing: A Systematic Literature Review
    Ramirez-Rueda, Rolando
    Benitez-Guerrero, Edgard
    Mezura-Godoy, Carmen
    Barcenas, Everardo
    2023 11TH INTERNATIONAL CONFERENCE IN SOFTWARE ENGINEERING RESEARCH AND INNOVATION, CONISOFT 2023, 2023, : 275 - 282
  • [22] Cyber threat assessment and management for securing healthcare ecosystems using natural language processing
    Silvestri, Stefano
    Islam, Shareful
    Amelin, Dmitry
    Weiler, Gabriele
    Papastergiou, Spyridon
    Ciampi, Mario
    INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2024, 23 (01) : 31 - 50
  • [23] Cyber threat assessment and management for securing healthcare ecosystems using natural language processing
    Stefano Silvestri
    Shareful Islam
    Dmitry Amelin
    Gabriele Weiler
    Spyridon Papastergiou
    Mario Ciampi
    International Journal of Information Security, 2024, 23 : 31 - 50
  • [24] Using Adversarial Examples in Natural Language Processing
    Belohlavek, Petr
    Platek, Ondrej
    Zabokrtsky, Zdenek
    Straka, Milan
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 3693 - 3700
  • [25] Objects Assessment Approach Using Natural Language Processing and Data Quality to Support Emergency Situation Assessment
    Sanches, Matheus F.
    Junior, Valdir A. P.
    Souza, Jessica O.
    Coneglian, Caio S.
    Jorge, Fabio R.
    Oliveira, Natalia P.
    Botega, Leonardo C.
    HCI INTERNATIONAL 2016 - POSTERS' EXTENDED ABSTRACTS, PT I, 2016, 617 : 238 - 244
  • [26] Emotion detection using natural language processing
    Nunez, Antonio alvarez
    Diaz, Maria del Carmen Santiago
    Vazquez, Ana Claudia Zenteno
    Marcial, Judith Perez
    Linares, Gustavo Trinidad Rubin
    INTERNATIONAL JOURNAL OF COMBINATORIAL OPTIMIZATION PROBLEMS AND INFORMATICS, 2024, 15 (05): : 108 - 114
  • [27] Using Natural Language Processing for Phishing Detection
    Jonker, Richard Adolph Aires
    Poudel, Roshan
    Pedrosa, Tiago
    Lopes, Rui Pedro
    OPTIMIZATION, LEARNING ALGORITHMS AND APPLICATIONS, OL2A 2021, 2021, 1488 : 540 - 552
  • [28] Generation of Oracles using Natural Language Processing
    Leong, Iat Tou
    Barbosa, Raul
    2021 28TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE WORKSHOPS (APSECW 2021), 2021, : 25 - 31
  • [29] A survey on extremism analysis using natural language processing: definitions, literature review, trends and challenges
    Torregrosa, Javier
    Bello-Orgaz, Gema
    Martinez-Camara, Eugenio
    Del Ser, Javier
    Camacho, David
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2022, 14 (8) : 9869 - 9905
  • [30] A survey on extremism analysis using natural language processing: definitions, literature review, trends and challenges
    Javier Torregrosa
    Gema Bello-Orgaz
    Eugenio Martínez-Cámara
    Javier Del Ser
    David Camacho
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 9869 - 9905