Evaluating the replicability of social science experiments in Nature and Science between 2010 and 2015

被引:665
作者
Camerer, Colin F. [1 ]
Dreber, Anna [2 ]
Holzmeister, Felix [3 ]
Ho, Teck-Hua [4 ]
Huber, Juegen [3 ]
Johannesson, Magnus [2 ]
Kirchler, Michael [3 ,5 ]
Nave, Gideon [6 ]
Nosek, Brian A. [7 ,8 ]
Pfeiffer, Thomas [9 ]
Altmejd, Adam [2 ]
Buttrick, Nick [7 ,8 ]
Chan, Taizan [10 ]
Chen, Yiling [11 ]
Forsell, Eskil [12 ]
Gampa, Anup [7 ,8 ]
Heikensten, Emma [2 ]
Hummer, Lily [8 ]
Imai, Taisuke [13 ]
Isaksson, Siri [2 ]
Manfredi, Dylan [6 ]
Rose, Julia [3 ]
Wagenmakers, Eric-Jan [14 ]
Wu, Hang [15 ]
机构
[1] CALTECH, Pasadena, CA 91125 USA
[2] Stockholm Sch Econ, Dept Econ, Stockholm, Sweden
[3] Univ Innsbruck, Dept Banking & Finance, Innsbruck, Austria
[4] Natl Univ Singapore, NUS Business Sch, Singapore, Singapore
[5] Univ Goteborg, Dept Econ, Ctr Finance, Gothenburg, Sweden
[6] Univ Penn, Wharton Sch, Philadelphia, PA 19104 USA
[7] Univ Virginia, Dept Psychol, Gilmer Hall, Charlottesville, VA 22903 USA
[8] Ctr Open Sci, Charlottesville, VA 22903 USA
[9] New Zealand Inst Adv Study, Auckland, New Zealand
[10] Natl Univ Singapore, Off Senior Deputy President & Provost, Singapore, Singapore
[11] Harvard Univ, John A Paulson Sch Engn & Appl Sci, Cambridge, MA 02138 USA
[12] Spotify Sweden AB, Stockholm, Sweden
[13] Ludwig Maximilians Univ Munchen, Dept Econ, Munich, Germany
[14] Univ Amsterdam, Dept Psychol, Amsterdam, Netherlands
[15] Harbin Inst Technol, Sch Management, Harbin, Heilongjiang, Peoples R China
基金
奥地利科学基金会; 新加坡国家研究基金会;
关键词
REPLICATION; REPRODUCIBILITY; CONSEQUENCES; CANCER; SIZE;
D O I
10.1038/s41562-018-0399-z
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Being able to replicate scientific findings is crucial for scientific progress1-15. We replicate 21 systematically selected experimental studies in the social sciences published in Nature and Science between 2010 and 201516-36. The replications follow analysis plans reviewed by the original authors and pre-registered prior to the replications. The replications are high powered, with sample sizes on average about five times higher than in the original studies. We find a significant effect in the same direction as the original study for 13 (62%) studies, and the effect size of the replications is on average about 50% of the original effect size. Replicability varies between 12 (57%) and 14 (67%) studies for complementary replicability indicators. Consistent with these results, the estimated truepositive rate is 67% in a Bayesian analysis. The relative effect size of true positives is estimated to be 71%, suggesting that both false positives and inflated effect sizes of true positives contribute to imperfect reproducibility. Furthermore, we find that peer beliefs of replicability are strongly related to replicability, suggesting that the research community could predict which results would replicate and that failures to replicate were not the result of chance alone.
引用
收藏
页码:637 / 644
页数:8
相关论文
共 56 条
[1]   Estimating the reproducibility of psychological science [J].
Aarts, Alexander A. ;
Anderson, Joanna E. ;
Anderson, Christopher J. ;
Attridge, Peter R. ;
Attwood, Angela ;
Axt, Jordan ;
Babel, Molly ;
Bahnik, Stepan ;
Baranski, Erica ;
Barnett-Cowan, Michael ;
Bartmess, Elizabeth ;
Beer, Jennifer ;
Bell, Raoul ;
Bentley, Heather ;
Beyan, Leah ;
Binion, Grace ;
Borsboom, Denny ;
Bosch, Annick ;
Bosco, Frank A. ;
Bowman, Sara D. ;
Brandt, Mark J. ;
Braswell, Erin ;
Brohmer, Hilmar ;
Brown, Benjamin T. ;
Brown, Kristina ;
Bruening, Jovita ;
Calhoun-Sauls, Ann ;
Callahan, Shannon P. ;
Chagnon, Elizabeth ;
Chandler, Jesse ;
Chartier, Christopher R. ;
Cheung, Felix ;
Christopherson, Cody D. ;
Cillessen, Linda ;
Clay, Russ ;
Cleary, Hayley ;
Cloud, Mark D. ;
Cohn, Michael ;
Cohoon, Johanna ;
Columbus, Simon ;
Cordes, Andreas ;
Costantini, Giulio ;
Alvarez, Leslie D. Cramblet ;
Cremata, Ed ;
Crusius, Jan ;
DeCoster, Jamie ;
DeGaetano, Michelle A. ;
Della Penna, Nicolas ;
den Bezemer, Bobby ;
Deserno, Marie K. .
SCIENCE, 2015, 349 (6251)
[2]   Incidental Haptic Sensations Influence Social Judgments and Decisions [J].
Ackerman, Joshua M. ;
Nocera, Christopher C. ;
Bargh, John A. .
SCIENCE, 2010, 328 (5986) :1712-1715
[3]   Response to Comment on "Estimating the reproducibility of psychological science" [J].
Anderson, Christopher J. ;
Bahnik, Stepan ;
Barnett-Cowan, Michael ;
Bosco, Frank A. ;
Chandler, Jesse ;
Chartier, Christopher R. ;
Cheung, Felix ;
Christopherson, Cody D. ;
Cordes, Andreas ;
Cremata, Edward J. ;
Della Penna, Nicolas ;
Estel, Vivien ;
Fedor, Anna ;
Fitneva, Stanka A. ;
Frank, Michael C. ;
Grange, James A. ;
Hartshorne, Joshua K. ;
Hasselman, Fred ;
Henninger, Felix ;
van der Hulst, Marije ;
Jonas, Kai J. ;
Lai, Calvin K. ;
Levitan, Carmel A. ;
Miller, Jeremy K. ;
Moore, Katherine S. ;
Meixner, Johannes M. ;
Munafo, Marcus R. ;
Neijenhuijs, Koen I. ;
Nilsonne, Gustav ;
Nosek, Brian A. ;
Plessow, Franziska ;
Prenoveau, Jason M. ;
Ricker, Ashley A. ;
Schmidt, Kathleen ;
Spies, Jeffrey R. ;
Stieger, Stefan ;
Strohminger, Nina ;
Sullivan, Gavin B. ;
van Aert, Robbie C. M. ;
van Assen, Marcel A. L. M. ;
Vanpaemel, Wolf ;
Vianello, Michelangelo ;
Voracek, Martin ;
Zuni, Kellylynn .
SCIENCE, 2016, 351 (6277)
[4]   Economics - The promise of prediction markets [J].
Arrow, Kenneth J. ;
Forsythe, Robert ;
Gorham, Michael ;
Hahn, Robert ;
Hanson, Robin ;
Ledyard, John O. ;
Levmore, Saul ;
Litan, Robert ;
Milgrom, Paul ;
Nelson, Forrest D. ;
Neumann, George R. ;
Ottaviani, Marco ;
Schelling, Thomas C. ;
Shiller, Robert J. ;
Smith, Vernon L. ;
Snowberg, Erik ;
Sunstein, Cass R. ;
Tetlock, Paul C. ;
Tetlock, Philip E. ;
Varian, Hal R. ;
Wolfers, Justin ;
Zitzewitz, Eric .
SCIENCE, 2008, 320 (5878) :877-878
[5]   A Mutation in EGF Repeat-8 of Notch Discriminates Between Serrate/Jagged and Delta Family Ligands [J].
Aviezer, Hillel ;
Trope, Yaacov ;
Todorov, Alexander .
SCIENCE, 2012, 338 (6111) :1225-1229
[6]  
Baker M, 2016, NATURE, V533, P452, DOI 10.1038/533452a
[7]   Affirmative Action Policies Promote Women and Do Not Harm Efficiency in the Laboratory [J].
Balafoutas, Loukas ;
Sutter, Matthias .
SCIENCE, 2012, 335 (6068) :579-582
[8]   Raise standards for preclinical cancer research [J].
Begley, C. Glenn ;
Ellis, Lee M. .
NATURE, 2012, 483 (7391) :531-533
[9]   Redefine statistical significance [J].
Benjamin, Daniel J. ;
Berger, James O. ;
Johannesson, Magnus ;
Nosek, Brian A. ;
Wagenmakers, E. -J. ;
Berk, Richard ;
Bollen, Kenneth A. ;
Brembs, Bjoern ;
Brown, Lawrence ;
Camerer, Colin ;
Cesarini, David ;
Chambers, Christopher D. ;
Clyde, Merlise ;
Cook, Thomas D. ;
De Boeck, Paul ;
Dienes, Zoltan ;
Dreber, Anna ;
Easwaran, Kenny ;
Efferson, Charles ;
Fehr, Ernst ;
Fidler, Fiona ;
Field, Andy P. ;
Forster, Malcolm ;
George, Edward I. ;
Gonzalez, Richard ;
Goodman, Steven ;
Green, Edwin ;
Green, Donald P. ;
Greenwald, Anthony ;
Hadfield, Jarrod D. ;
Hedges, Larry V. ;
Held, Leonhard ;
Ho, Teck Hua ;
Hoijtink, Herbert ;
Hruschka, Daniel J. ;
Imai, Kosuke ;
Imbens, Guido ;
Ioannidis, John P. A. ;
Jeon, Minjeong ;
Jones, James Holland ;
Kirchler, Michael ;
Laibson, David ;
List, John ;
Little, Roderick ;
Lupia, Arthur ;
Machery, Edouard ;
Maxwell, Scott E. ;
McCarthy, Michael ;
Moore, Don ;
Morgan, Stephen L. .
NATURE HUMAN BEHAVIOUR, 2018, 2 (01) :6-10
[10]   Replication effort provokes praise-and 'bullying' charges [J].
Bohannon, John .
SCIENCE, 2014, 344 (6186) :788-789