Testing Significance Testing

被引:5
作者
Krueger, Joachim, I [1 ]
Heck, Patrick R. [2 ]
机构
[1] Brown Univ, Dept Cognit Linguist & Psychol Sci, Providence, RI 02912 USA
[2] Geisinger Hlth Syst, Danville, PA 17822 USA
关键词
statistical significance testing; null hypotheses; Bayes' Theorem; NHST; p values; P-VALUES; STATISTICAL SIGNIFICANCE; CONFIDENCE-INTERVALS; PSYCHOLOGY; REPLICATION; SCIENCE; FUTURE; POWER; WILL;
D O I
10.1525/collabra.108
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
The practice of Significance Testing (ST) remains widespread in psychological science despite continual criticism of its flaws and abuses. Using simulation experiments, we address four concerns about ST and for two of these we compare ST's performance with prominent alternatives. We find the following: First, the p values delivered by ST predict the posterior probability of the tested hypothesis well under many research conditions. Second, low p values support inductive inferences because they are most likely to occur when the tested hypothesis is false. Third, p values track likelihood ratios without raising the uncertainties of relative inference. Fourth, p values predict the replicability of research findings better than confidence intervals do. Given these results, we conclude that p values may be used judiciously as a heuristic tool for inductive inference. Yet, p values cannot bear the full burden of inference. We encourage researchers to be flexible in their selection and use of statistical methods.
引用
收藏
页数:13
相关论文
共 70 条
[1]   Estimating the reproducibility of psychological science [J].
Aarts, Alexander A. ;
Anderson, Joanna E. ;
Anderson, Christopher J. ;
Attridge, Peter R. ;
Attwood, Angela ;
Axt, Jordan ;
Babel, Molly ;
Bahnik, Stepan ;
Baranski, Erica ;
Barnett-Cowan, Michael ;
Bartmess, Elizabeth ;
Beer, Jennifer ;
Bell, Raoul ;
Bentley, Heather ;
Beyan, Leah ;
Binion, Grace ;
Borsboom, Denny ;
Bosch, Annick ;
Bosco, Frank A. ;
Bowman, Sara D. ;
Brandt, Mark J. ;
Braswell, Erin ;
Brohmer, Hilmar ;
Brown, Benjamin T. ;
Brown, Kristina ;
Bruening, Jovita ;
Calhoun-Sauls, Ann ;
Callahan, Shannon P. ;
Chagnon, Elizabeth ;
Chandler, Jesse ;
Chartier, Christopher R. ;
Cheung, Felix ;
Christopherson, Cody D. ;
Cillessen, Linda ;
Clay, Russ ;
Cleary, Hayley ;
Cloud, Mark D. ;
Cohn, Michael ;
Cohoon, Johanna ;
Columbus, Simon ;
Cordes, Andreas ;
Costantini, Giulio ;
Alvarez, Leslie D. Cramblet ;
Cremata, Ed ;
Crusius, Jan ;
DeCoster, Jamie ;
DeGaetano, Michelle A. ;
Della Penna, Nicolas ;
den Bezemer, Bobby ;
Deserno, Marie K. .
SCIENCE, 2015, 349 (6251)
[2]  
Abelson R.P., 1995, Statistics as principled argument, V1st, DOI 10.4324/9781410601155
[3]   The earth is flat (p > 0.05): significance thresholds and the crisis of unreplicable research [J].
Amrhein, Valentin ;
Korner-Nievergelt, Franzi ;
Roth, Tobias .
PEERJ, 2017, 5
[4]  
[Anonymous], ARXIV170907588
[5]  
[Anonymous], 2017, PSYCHOL SCI SCRUTINY
[6]   Rejection odds and rejection ratios: A proposal for statistical practice in testing hypotheses [J].
Bayarri, M. J. ;
Benjamin, Daniel J. ;
Berger, James O. ;
Sellke, Thomas M. .
JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2016, 72 :90-103
[7]   Feeling the Future: Experimental Evidence for Anomalous Retroactive Influences on Cognition and Affect [J].
Bem, Daryl J. .
JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 2011, 100 (03) :407-425
[8]   Redefine statistical significance [J].
Benjamin, Daniel J. ;
Berger, James O. ;
Johannesson, Magnus ;
Nosek, Brian A. ;
Wagenmakers, E. -J. ;
Berk, Richard ;
Bollen, Kenneth A. ;
Brembs, Bjoern ;
Brown, Lawrence ;
Camerer, Colin ;
Cesarini, David ;
Chambers, Christopher D. ;
Clyde, Merlise ;
Cook, Thomas D. ;
De Boeck, Paul ;
Dienes, Zoltan ;
Dreber, Anna ;
Easwaran, Kenny ;
Efferson, Charles ;
Fehr, Ernst ;
Fidler, Fiona ;
Field, Andy P. ;
Forster, Malcolm ;
George, Edward I. ;
Gonzalez, Richard ;
Goodman, Steven ;
Green, Edwin ;
Green, Donald P. ;
Greenwald, Anthony ;
Hadfield, Jarrod D. ;
Hedges, Larry V. ;
Held, Leonhard ;
Ho, Teck Hua ;
Hoijtink, Herbert ;
Hruschka, Daniel J. ;
Imai, Kosuke ;
Imbens, Guido ;
Ioannidis, John P. A. ;
Jeon, Minjeong ;
Jones, James Holland ;
Kirchler, Michael ;
Laibson, David ;
List, John ;
Little, Roderick ;
Lupia, Arthur ;
Machery, Edouard ;
Maxwell, Scott E. ;
McCarthy, Michael ;
Moore, Don ;
Morgan, Stephen L. .
NATURE HUMAN BEHAVIOUR, 2018, 2 (01) :6-10
[9]   EXPOSITION OF A NEW THEORY ON THE MEASUREMENT OF RISK [J].
Bernoulli, Daniel .
ECONOMETRICA, 1954, 22 (01) :23-36
[10]  
Brewer M.B., 2007, Social psychology: Handbook of basic principles, P695