A Statistical Model to Investigate the Reproducibility Rate Based on Replication Experiments

被引:2
作者
Pauli, Francesco [1 ]
机构
[1] Univ Trieste, DEAMS, Ple Europa 1, I-34127 Trieste, Italy
关键词
p-value; false discovery rate; reproducibility crisis; mixture model; P VALUES; REPLICABILITY;
D O I
10.1111/insr.12273
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The reproducibility crisis, that is, the fact that many scientific results are difficult to replicate, pointing to their unreliability or falsehood, is a hot topic in the recent scientific literature, and statistical methodologies, testing procedures and p-values, in particular, are at the centre of the debate. Assessment of the extent of the problem-the reproducibility rate or the false discovery rate-and the role of contributing factors are still an open problem. Replication experiments, that is, systematic replications of existing results, may offer relevant information on these issues. We propose a statistical model to deal with such information, in particular to estimate the reproducibility rate and the effect of some study characteristics on its reliability. We analyse data from a recent replication experiment in psychology finding a reproducibility rate broadly coherent with other assessments from the same experiment. Our results also confirm the expected role of some contributing factor (unexpectedness of the result and room for bias) while they suggest that the similarity between original study and the replica is not so relevant, thus mitigating some criticism directed to replication experiments.
引用
收藏
页码:68 / 79
页数:12
相关论文
共 51 条
  • [1] Estimating the reproducibility of psychological science
    Aarts, Alexander A.
    Anderson, Joanna E.
    Anderson, Christopher J.
    Attridge, Peter R.
    Attwood, Angela
    Axt, Jordan
    Babel, Molly
    Bahnik, Stepan
    Baranski, Erica
    Barnett-Cowan, Michael
    Bartmess, Elizabeth
    Beer, Jennifer
    Bell, Raoul
    Bentley, Heather
    Beyan, Leah
    Binion, Grace
    Borsboom, Denny
    Bosch, Annick
    Bosco, Frank A.
    Bowman, Sara D.
    Brandt, Mark J.
    Braswell, Erin
    Brohmer, Hilmar
    Brown, Benjamin T.
    Brown, Kristina
    Bruening, Jovita
    Calhoun-Sauls, Ann
    Callahan, Shannon P.
    Chagnon, Elizabeth
    Chandler, Jesse
    Chartier, Christopher R.
    Cheung, Felix
    Christopherson, Cody D.
    Cillessen, Linda
    Clay, Russ
    Cleary, Hayley
    Cloud, Mark D.
    Cohn, Michael
    Cohoon, Johanna
    Columbus, Simon
    Cordes, Andreas
    Costantini, Giulio
    Alvarez, Leslie D. Cramblet
    Cremata, Ed
    Crusius, Jan
    DeCoster, Jamie
    DeGaetano, Michelle A.
    Della Penna, Nicolas
    den Bezemer, Bobby
    Deserno, Marie K.
    [J]. SCIENCE, 2015, 349 (6251)
  • [2] Response to Comment on "Estimating the reproducibility of psychological science"
    Anderson, Christopher J.
    Bahnik, Stepan
    Barnett-Cowan, Michael
    Bosco, Frank A.
    Chandler, Jesse
    Chartier, Christopher R.
    Cheung, Felix
    Christopherson, Cody D.
    Cordes, Andreas
    Cremata, Edward J.
    Della Penna, Nicolas
    Estel, Vivien
    Fedor, Anna
    Fitneva, Stanka A.
    Frank, Michael C.
    Grange, James A.
    Hartshorne, Joshua K.
    Hasselman, Fred
    Henninger, Felix
    van der Hulst, Marije
    Jonas, Kai J.
    Lai, Calvin K.
    Levitan, Carmel A.
    Miller, Jeremy K.
    Moore, Katherine S.
    Meixner, Johannes M.
    Munafo, Marcus R.
    Neijenhuijs, Koen I.
    Nilsonne, Gustav
    Nosek, Brian A.
    Plessow, Franziska
    Prenoveau, Jason M.
    Ricker, Ashley A.
    Schmidt, Kathleen
    Spies, Jeffrey R.
    Stieger, Stefan
    Strohminger, Nina
    Sullivan, Gavin B.
    van Aert, Robbie C. M.
    van Assen, Marcel A. L. M.
    Vanpaemel, Wolf
    Vianello, Michelangelo
    Voracek, Martin
    Zuni, Kellylynn
    [J]. SCIENCE, 2016, 351 (6277)
  • [3] [Anonymous], STAT CHALL ASS FOST
  • [4] [Anonymous], 2017, RSTAN R INTERFACE ST
  • [5] Baker M, 2016, NATURE, V533, P452, DOI 10.1038/533452a
  • [6] Raise standards for preclinical cancer research
    Begley, C. Glenn
    Ellis, Lee M.
    [J]. NATURE, 2012, 483 (7391) : 531 - 533
  • [7] Could Fisher, Jeffreys and Neyman have agreed on testing?
    Berger, JO
    [J]. STATISTICAL SCIENCE, 2003, 18 (01) : 1 - 12
  • [8] P-Value Precision and Reproducibility
    Boos, Dennis D.
    Stefanski, Leonard A.
    [J]. AMERICAN STATISTICIAN, 2011, 65 (04) : 213 - 221
  • [9] Star Wars: The Empirics Strike Back
    Brodeur, Abel
    Le, Mathias
    Sangnier, Marc
    Zylberberg, Yanos
    [J]. AMERICAN ECONOMIC JOURNAL-APPLIED ECONOMICS, 2016, 8 (01) : 1 - 32
  • [10] New concerns raised over value of genome-wide disease studies
    Callaway, Ewen
    [J]. NATURE, 2017, 546 (7659) : 463 - 463