Day-to-day discovery of preprint-publication links

被引:26
作者
Cabanac, Guillaume [1 ]
Oikonomidi, Theodora [2 ]
Boutron, Isabelle [2 ,3 ,4 ]
机构
[1] Univ Toulouse, Comp Sci Dept, IRIT, UMR 5505,CNRS, 118 Route Narbonne, F-31062 Toulouse 9, France
[2] Univ Paris, Ctr Res Epidemiol & Stat CRESS, INSERM, F-75004 Paris, France
[3] Hop Hotel Dieu, AP HP, Ctr Epidemiol Clin, F-75004 Paris, France
[4] Cochrane France, F-75004 Paris, France
关键词
Data linking; Preprint; Publication; Living systematic review; COVID-19; E-PRINTS;
D O I
10.1007/s11192-021-03900-7
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Preprints promote the open and fast communication of non-peer reviewed work. Once a preprint is published in a peer-reviewed venue, the preprint server updates its web page: a prominent hyperlink leading to the newly published work is added. Linking preprints to publications is of utmost importance as it provides readers with the latest version of a now certified work. Yet leading preprint servers fail to identify all existing preprint-publication links. This limitation calls for a more thorough approach to this critical information retrieval task: overlooking published evidence translates into partial and even inaccurate systematic reviews on health-related issues, for instance. We designed an algorithm leveraging the Crossref public and free source of bibliographic metadata to comb the literature for preprint-publication links. We tested it on a reference preprint set identified and curated for a living systematic review on interventions for preventing and treating COVID-19 performed by international collaboration: the COVID-NMA initiative (covid-nma.com). The reference set comprised 343 preprints, 121 of which appeared as a publication in a peer-reviewed journal. While the preprint servers identified 39.7% of the preprint-publication links, our linker identified 90.9% of the expected links with no clues taken from the preprint servers. The accuracy of the proposed linker is 91.5% on this reference set, with 90.9% sensitivity and 91.9% specificity. This is a 16.26% increase in accuracy compared to that of preprint servers. We release this software as supplementary material to foster its integration into preprint servers' workflows and enhance a daily preprint-publication chase that is useful to all readers, including systematic reviewers. This preprint-publication linker currently provides day-to-day updates to the biomedical experts of the COVID-NMA initiative.
引用
收藏
页码:5285 / 5304
页数:20
相关论文
共 33 条
[1]   International authorship and collaboration across bioRxiv preprints [J].
Abdill, Richard J. ;
Adamowicz, Elizabeth M. ;
Blekhman, Ran .
ELIFE, 2020, 9 :1-17
[2]   Tracking the popularity and outcomes of all bioRxiv preprints [J].
Abdill, Richard J. ;
Blekhman, Ran .
ELIFE, 2019, 8
[3]   bioRxiv: Trends and analysis of five years of preprints [J].
Anderson, Kent R. .
LEARNED PUBLISHING, 2020, 33 (02) :104-109
[4]  
[Anonymous], 2020, MEDRXIV
[5]   Where have all the working papers gone? Evidence from four major economics working paper series [J].
Baumann, Alexandra ;
Wohlrabe, Klaus .
SCIENTOMETRICS, 2020, 124 (03) :2433-2441
[6]  
Boutron I, 2020, Cochrane Database Syst Rev, DOI DOI 10.1002/14651858.CD013769
[7]   The COVID-NMA Project: Building an Evidence Ecosystem for the COVID-19 Pandemic [J].
Boutron, Isabelle ;
Chaimani, Anna ;
Meerpohl, Joerg J. ;
Hrobjartsson, Asbjorn ;
Devane, Declan ;
Rada, Gabriel ;
Tovey, David ;
Grasselli, Giacomo ;
Ravaud, Philippe .
ANNALS OF INTERNAL MEDICINE, 2020, 173 (12) :1015-+
[8]   New tools aim to tame pandemic paper tsunami [J].
Brainard, Jeffrey .
SCIENCE, 2020, 368 (6494) :924-925
[9]   Living Systematic Reviews: An Emerging Opportunity to Narrow the Evidence-Practice Gap [J].
Elliott, Julian H. ;
Turner, Tari ;
Clavisi, Ornella ;
Thomas, James ;
Higgins, Julian P. T. ;
Mavergames, Chris ;
Gruen, Russell L. .
PLOS MEDICINE, 2014, 11 (02)
[10]  
Fraser N, 2020, PREPRINTING PANDEMIC, DOI [10.1101/2020.05.22.111294., DOI 10.1101/2020.05.22.111294, 10.1101/2020.05.22.111294]