Risk of bias assessment in preclinical literature using natural language processing

被引:12
作者
Wang, Qianying [1 ]
Liao, Jing [1 ]
Lapata, Mirella [2 ]
Macleod, Malcolm [1 ]
机构
[1] Univ Edinburgh, Ctr Clin Brain Sci, 49 Little France Crescent, Edinburgh EH16 4SB, Midlothian, Scotland
[2] Univ Edinburgh, Sch Informat, Edinburgh, Midlothian, Scotland
关键词
automatic assessment; natural language processing; preclinical research synthesis; risk of bias;
D O I
10.1002/jrsm.1533
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We sought to apply natural language processing to the task of automatic risk of bias assessment in preclinical literature, which could speed the process of systematic review, provide information to guide research improvement activity, and support translation from preclinical to clinical research. We use 7840 full-text publications describing animal experiments with yes/no annotations for five risk of bias items. We implement a series of models including baselines (support vector machine, logistic regression, random forest), neural models (convolutional neural network, recurrent neural network with attention, hierarchical neural network) and models using BERT with two strategies (document chunk pooling and sentence extraction). We tune hyperparameters to obtain the highest F1 scores for each risk of bias item on the validation set and compare evaluation results on the test set to our previous regular expression approach. The F1 scores of best models on test set are 82.0% for random allocation, 81.6% for blinded assessment of outcome, 82.6% for conflict of interests, 91.4% for compliance with animal welfare regulations and 46.6% for reporting animals excluded from analysis. Our models significantly outperform regular expressions for four risk of bias items. For random allocation, blinded assessment of outcome, conflict of interests and animal exclusions, neural models achieve good performance; for animal welfare regulations, BERT model with a sentence extraction strategy works better. Convolutional neural networks are the overall best models. The tool is publicly available which may contribute to the future monitoring of risk of bias reporting for research improvement activities.
引用
收藏
页码:368 / 380
页数:13
相关论文
共 50 条
  • [41] Natural Language Processing of Social Media as Screening for Suicide Risk
    Coppersmith, Glen
    Leary, Ryan
    Crutchley, Patrick
    Fine, Alex
    BIOMEDICAL INFORMATICS INSIGHTS, 2018, 10
  • [42] Legal Judgment Prediction using Natural Language Processing and Machine Learning Methods: A Systematic Literature Review
    Dina, Nasa Zata
    Ravana, Sri Devi
    Idris, Norisma
    SAGE OPEN, 2025, 15 (02):
  • [43] Automatic summarisation of product reviews using natural language processing and machine learning methods: a literature review
    Rani, Sonia
    Walia, Tarandeep Singh
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2022, 27 (1-3) : 118 - 138
  • [44] Exploring a Language-Based Interest Assessment: Predicting Vocational Interests on Social Media Using Natural Language Processing
    Du, Yan Yi Lance
    Jain, Devansh
    Cho, Young-Min
    Hou, Daphne Xin
    Guntuku, Sharath Chandra
    Ungar, Lyle
    Tay, Louis
    JOURNAL OF CAREER ASSESSMENT, 2024,
  • [45] Identification of pancreatic cancer risk factors from clinical notes using natural language processing
    Sarwal, Dhruv
    Wang, Liwei
    Gandhi, Sonal
    Pour, Elham Sagheb Hossein
    Janssens, Laurens P.
    Delgado, Adriana M.
    Doering, Karen A.
    Mishra, Anup Kumar
    Greenwood, Jason D.
    Liu, Hongfang
    Majumder, Shounak
    PANCREATOLOGY, 2024, 24 (04) : 572 - 578
  • [46] Risk markers identification in EHR using natural language processing: hemorrhagic and ischemic stroke cases
    Grechishcheva, Sofia
    Efimov, Egor
    Metsker, Oleg
    8TH INTERNATIONAL YOUNG SCIENTISTS CONFERENCE ON COMPUTATIONAL SCIENCE, YSC2019, 2019, 156 : 142 - 149
  • [47] Automatic extraction and assessment of lifestyle exposures for Alzheimer's disease using natural language processing
    Zhou, Xin
    Wang, Yanshan
    Sohn, Sunghwan
    Therneau, Terry M.
    Liu, Hongfang
    Knopman, David S.
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2019, 130
  • [48] Classification of Poverty Condition Using Natural Language Processing
    Muneton-Santa, Guberney
    Escobar-Grisales, Daniel
    Orlando Lopez-Pabon, Felipe
    Perez-Toro, Paula Andrea
    Rafael Orozco-Arroyave, Juan
    SOCIAL INDICATORS RESEARCH, 2022, 162 (03) : 1413 - 1435
  • [49] Automating curation using a natural language processing pipeline
    Alex B.
    Grover C.
    Haddow B.
    Kabadjov M.
    Klein E.
    Matthews M.
    Tobin R.
    Wang X.
    Genome Biology, 9 (Suppl 2)
  • [50] Semantic Search Engine Using Natural Language Processing
    Pandiarajan, Sudhakar
    Yazhmozhi, V. M.
    Kumar, P. Praveen
    ADVANCED COMPUTER AND COMMUNICATION ENGINEERING TECHNOLOGY, 2015, 315 : 561 - 571