The FAIR database: facilitating access to public health research literature

被引:0
作者
Zhao, Zhixue [1 ]
Thomas, James [2 ]
Kell, Gregory [3 ]
Stansfield, Claire [2 ]
Clowes, Mark [4 ]
Graziosi, Sergio [2 ]
Brunton, Jeff [2 ]
Marshall, Iain James [3 ]
Stevenson, Mark [1 ]
机构
[1] Univ Sheffield, Dept Comp Sci, Sheffield S10 2TN, England
[2] UCL, UCL Social Res Inst, Inst Educ, EPPI Ctr, 10 Woburn Sq, London WC1H 0NS, England
[3] Univ Sheffield, Sch Med & Populat Hlth, Sheffield S10 2TN, England
[4] Kings Coll London, Fac Life Sci & Med, Sch Life Course & Populat Sci, Dept Populat Hlth Sci, London WC2R 2LS, England
基金
美国国家卫生研究院;
关键词
evidence synthesis; research synthesis; public health; inequalities; machine learning; automatic database curation; INTERVENTIONS;
D O I
10.1093/jamiaopen/ooae139
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objectives In public health, access to research literature is critical to informing decision-making and to identify knowledge gaps. However, identifying relevant research is not a straightforward task since public health interventions are often complex, can have positive and negative impacts on health inequalities and are applied in diverse and rapidly evolving settings. We developed a "living" database of public health research literature to facilitate access to this information using Natural Language Processing tools.Materials and Methods Classifiers were identified to identify the study design (eg, cohort study or clinical trial) and relationship to factors that may be relevant to inequalities using the PROGRESS-Plus classification scheme. Training data were obtained from existing MEDLINE labels and from a set of systematic reviews in which studies were annotated with PROGRESS-Plus categories.Results Evaluation of the classifiers showed that the study type classifier achieved average precision and recall of 0.803 and 0.930, respectively. The PROGRESS-Plus classification proved more challenging with average precision and recall of 0.608 and 0.534. The FAIR database uses information provided by these classifiers to facilitate access to inequality-related public health literature.Discussion Previous work on automation of evidence synthesis has focused on clinical areas rather than public health, despite the need being arguably greater.Conclusion The development of the FAIR database demonstrates that it is possible to create a publicly accessible and regularly updated database of public health research literature focused on inequalities. The database is freely available from https://eppi.ioe.ac.uk/eppi-vis/Fair.NETSCC ID number NIHR133603. When people are making decisions about which services to commission, or which treatments to use, it is important to consider the research evidence. Even well-intentioned interventions can increase inequalities in health between different population groups, so understanding potential impacts here is critical given the need to reduce inequalities. Unfortunately, it is very difficult to find research about inequalities, owing to the way that research is published and stored in large databases (often containing millions of articles).To address the above problem, we built the FAIR database that locates public health research and identifies those papers that are about inequalities. It then uses machine learning to catalog the papers and makes them easy for decision-makers to find. This research paper reports the work involved in developing the database, and the research we undertook to evaluate how accurate the machine learning components are. We found that the machine learning was quite good at identifying the type of research reported but was much more variable when identifying specific types of inequalities. This was due to a lack of data for training the machine learning. We would expect better performance if more training data were available.
引用
收藏
页数:9
相关论文
共 21 条
[1]   Refining Boolean queries to identify relevant studies for systematic review updates [J].
Alharbi, Amal ;
Stevenson, Mark .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (11) :1658-1666
[2]  
Donnelly CA, 2018, NATURE, V558, P361, DOI 10.1038/d41586-018-05414-4
[3]   Machine learning computational tools to assist the performance of systematic reviews: A mapping review [J].
Jimenez, Ramon Cierco ;
Lee, Teresa ;
Rosillo, Nicolas ;
Cordova, Reynalda ;
Cree, Ian A. ;
Gonzalez, Angel ;
Ruiz, Blanca Iciar Indave .
BMC MEDICAL RESEARCH METHODOLOGY, 2022, 22 (01)
[4]   Automating data extraction in systematic reviews: A systematic review [J].
Jonnalagadda S.R. ;
Goyal P. ;
Huffman M.D. .
Systematic Reviews, 4 (1)
[5]   ExaCT: automatic extraction of clinical trial characteristics from journal publications [J].
Kiritchenko, Svetlana ;
de Bruijn, Berry ;
Carini, Simona ;
Martin, Joel ;
Sim, Ida .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2010, 10
[6]   What types of interventions generate inequalities? Evidence from systematic reviews [J].
Lorenc, Theo ;
Petticrew, Mark ;
Welch, Vivian ;
Tugwell, Peter .
JOURNAL OF EPIDEMIOLOGY AND COMMUNITY HEALTH, 2013, 67 (02) :190-193
[7]  
Marshall I., 2015, J Am Med Informatics Assoc, V23, P1
[8]   Trialstreamer: A living, automatically updated database of clinical trial reports [J].
Marshall, Iain J. ;
Nye, Benjamin ;
Kuiper, Joel ;
Noel-Storr, Anna ;
Marshall, Rachel ;
Maclean, Rory ;
Soboczenski, Frank ;
Nenkova, Ani ;
Thomas, James ;
Wallace, Byron C. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (12) :1903-1912
[9]   Evaluation of public health interventions from a complex systems perspective: A research methods review [J].
McGill, Elizabeth ;
Er, Vanessa ;
Penney, Tarra ;
Egan, Matt ;
White, Martin ;
Meier, Petra ;
Whitehead, Margaret ;
Lock, Karen ;
de Cuevas, Rachel Anderson ;
Smith, Richard ;
Savona, Natalie ;
Rutter, Harry ;
Marks, Dalya ;
de Vocht, Frank ;
Cummins, Steven ;
Popay, Jennie ;
Petticrew, Mark .
SOCIAL SCIENCE & MEDICINE, 2021, 272
[10]   Using text mining for study identification in systematic reviews: A systematic review of current approaches [J].
O'Mara-Eves A. ;
Thomas J. ;
McNaught J. ;
Miwa M. ;
Ananiadou S. .
Systematic Reviews, 4 (1)