Data availability, reusability, and analytic reproducibility: evaluating the impact of a mandatory open data policy at the journal Cognition

被引:194
作者
Hardwicke, Tom E. [1 ]
Mathur, Maya B. [2 ,4 ]
MacDonald, Kyle [3 ]
Nilsonne, Gustav [3 ,5 ,6 ]
Banks, George C. [7 ]
Kidwell, Mallory C. [8 ]
Mohr, Alicia Hofelich [9 ]
Clayton, Elizabeth [10 ]
Yoon, Erica J. [3 ]
Tessler, Michael Henry [3 ]
Lenne, Richie L. [11 ]
Altman, Sara [3 ]
Long, Bria [3 ]
Frank, Michael C. [3 ]
机构
[1] Stanford Univ, Meta Res Innovat Ctr Stanford METRICS, Palo Alto, CA 94304 USA
[2] Stanford Univ, Quantitat Sci Unit, Palo Alto, CA 94304 USA
[3] Stanford Univ, Dept Psychol, Palo Alto, CA 94304 USA
[4] Harvard Univ, Harvard Biostat, Cambridge, MA 02138 USA
[5] Stockholm Univ, Stress Res Inst, Stockholm, Sweden
[6] Karolinska Inst, Dept Clin Neurosci, Stockholm, Sweden
[7] Univ N Carolina, Belk Coll Business, Charlotte, NC USA
[8] Univ Utah, Dept Psychol, Salt Lake City, UT 84112 USA
[9] Univ Minnesota, LATIS, Minneapolis, MN USA
[10] Univ N Carolina, Org Sci Program, Charlotte, NC USA
[11] Univ Minnesota, Dept Psychol, Minneapolis, MN USA
基金
美国国家科学基金会;
关键词
open data; reproducibility; open science; meta-science; interrupted time series; journal policy; ODDS RATIO; PSYCHOLOGY; LESSONS; SCIENCE;
D O I
10.1098/rsos.180448
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Access to data is a critical feature of an efficient, progressive and ultimately self-correcting scientific ecosystem. But the extent to which in-principle benefits of data sharing are realized in practice is unclear. Crucially, it is largely unknown whether published findings can be reproduced by repeating reported analyses upon shared data ('analytic reproducibility'). To investigate this, we conducted an observational evaluation of a mandatory open data policy introduced at the journal Cognition. Interrupted time-series analyses indicated a substantial post-policy increase in data available statements (104/417, 25% pre-policy to 136/ 174, 78% post-policy), although not all data appeared reusable (23/ 104, 22% pre-policy to 85/136, 62%, post-policy). For 35 of the articles determined to have reusable data, we attempted to reproduce 1324 target values. Ultimately, 64 values could not be reproduced within a 10% margin of error. For 22 articles all target values were reproduced, but 11 of these required author assistance. For 13 articles at least one value could not be reproduced despite author assistance. Importantly, there were no clear indications that original conclusions were seriously impacted. Mandatory open data policies can increase the frequency and quality of data sharing. However, suboptimal data curation, unclear analysis specification and reporting errors can impede analytic reproducibility, undermining the utility of data sharing and the credibility of scientific findings.
引用
收藏
页数:18
相关论文
共 58 条
[1]   Estimating the reproducibility of psychological science [J].
Aarts, Alexander A. ;
Anderson, Joanna E. ;
Anderson, Christopher J. ;
Attridge, Peter R. ;
Attwood, Angela ;
Axt, Jordan ;
Babel, Molly ;
Bahnik, Stepan ;
Baranski, Erica ;
Barnett-Cowan, Michael ;
Bartmess, Elizabeth ;
Beer, Jennifer ;
Bell, Raoul ;
Bentley, Heather ;
Beyan, Leah ;
Binion, Grace ;
Borsboom, Denny ;
Bosch, Annick ;
Bosco, Frank A. ;
Bowman, Sara D. ;
Brandt, Mark J. ;
Braswell, Erin ;
Brohmer, Hilmar ;
Brown, Benjamin T. ;
Brown, Kristina ;
Bruening, Jovita ;
Calhoun-Sauls, Ann ;
Callahan, Shannon P. ;
Chagnon, Elizabeth ;
Chandler, Jesse ;
Chartier, Christopher R. ;
Cheung, Felix ;
Christopherson, Cody D. ;
Cillessen, Linda ;
Clay, Russ ;
Cleary, Hayley ;
Cloud, Mark D. ;
Cohn, Michael ;
Cohoon, Johanna ;
Columbus, Simon ;
Cordes, Andreas ;
Costantini, Giulio ;
Alvarez, Leslie D. Cramblet ;
Cremata, Ed ;
Crusius, Jan ;
DeCoster, Jamie ;
DeGaetano, Michelle A. ;
Della Penna, Nicolas ;
den Bezemer, Bobby ;
Deserno, Marie K. .
SCIENCE, 2015, 349 (6251)
[2]   Public Availability of Published Research Data in High-Impact Journals [J].
Alsheikh-Ali, Alawi A. ;
Qureshi, Waqas ;
Al-Mallah, Mouaz H. ;
Ioannidis, John P. A. .
PLOS ONE, 2011, 6 (09)
[3]  
Aust F., 2017, papaja: Create APA manuscripts with R Markdown
[4]   The (mis)reporting of statistical results in psychology journals [J].
Bakker, Marjan ;
Wicherts, Jelte M. .
BEHAVIOR RESEARCH METHODS, 2011, 43 (03) :666-678
[5]   Interrupted time series regression for the evaluation of public health interventions: a tutorial [J].
Bernal, James Lopez ;
Cummins, Steven ;
Gasparrini, Antonio .
INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2017, 46 (01) :348-355
[6]  
Bollen K., 2015, technical report
[7]  
Chang AC, 2015, FINANCE EC DISCUSSIO, V2015-083, DOI [10.1136/bmj.k400, DOI 10.1136/BMJ.K400]
[8]  
Cognition, 2017, GUIDE FOR AUTHORS
[9]   Statistical reform in psychology - Is anything changing? [J].
Cumming, Geoff ;
Fidler, Fiona ;
Leonard, Martine ;
Kalinowski, Pavel ;
Christiansen, Ashton ;
Kleinig, Anita ;
Lo, Jessica ;
McMenamin, Natalie ;
Wilson, Sarah .
PSYCHOLOGICAL SCIENCE, 2007, 18 (03) :230-232
[10]   Lessons from a Decade of Replications at the Quarterly Journal of Political Science [J].
Eubank, Nicholas .
PS-POLITICAL SCIENCE & POLITICS, 2016, 49 (02) :273-276