Robust misinterpretation of confidence intervals

被引：0

作者：

Rink Hoekstra

Richard D. Morey

Jeffrey N. Rouder

Eric-Jan Wagenmakers

机构：

[1] University of Groningen,

[2] University of Missouri,undefined

来源：

Psychonomic Bulletin & Review | 2014年 / 21卷

关键词：

Confidence intervals; Significance testing; Inference;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Null hypothesis significance testing (NHST) is undoubtedly the most common inferential technique used to justify claims in the social sciences. However, even staunch defenders of NHST agree that its outcomes are often misinterpreted. Confidence intervals (CIs) have frequently been proposed as a more useful alternative to NHST, and their use is strongly encouraged in the APA Manual. Nevertheless, little is known about how researchers interpret CIs. In this study, 120 researchers and 442 students—all in the field of psychology—were asked to assess the truth value of six particular statements involving different interpretations of a CI. Although all six statements were false, both researchers and students endorsed, on average, more than three statements, indicating a gross misunderstanding of CIs. Self-declared experience with statistics was not related to researchers’ performance, and, even more surprisingly, researchers hardly outperformed the students, even though the students had not received any education on statistical inference whatsoever. Our findings suggest that many researchers do not know the correct interpretation of a CI. The misunderstandings surrounding p-values and CIs are particularly unfortunate because they constitute the main tools by which psychologists draw conclusions from data.

引用

页码：1157 / 1164

页数：7

共 72 条

[1]

Belia S(2005)Researchers misunderstand confidence intervals and standard error bars Psychological Methods 10 389-396

[2]

Fidler F(2006)The case for objective Bayesian analysis Bayesian Analysis 1 385-402

[3]

Williams J(1942)Tests of significance considered as evidence Journal of the American Statistical Association 37 325-335

[4]

Cumming G(2000)Paradoxes and improvements in interval estimation The American Statistician 54 242-247

[5]

Berger JO(1998)A précis of “Statistical significance: Rationale, validity and utility Behavioral and Brain Sciences 21 169-194

[6]

Berkson J(1994)The earth is round (p <.05) American Psychologist 49 997-1003

[7]

Blaker H(1997)On the logic and purpose of significance testing Psychological Methods 2 161-172

[8]

Spjøtvoll E(2001)A primer on the understanding, use, and calculation of confidence intervals that are based on central and non-central distributions Educational and Psychological Measurement 61 532-574

[9]

Chow SL(2000)Multiple comparisons: Philosophies and illustrations American Journal of Physiology - Regulatory, Integrative and Comparative Physiology 279 R1-R8

[10]

Cohen J(2011)Bayesian versus orthodox statistics: Which side are you on? Perspectives on Psychological Science 6 274-290

← 1 2 3 4 5 6 7 8 →