P Values and Statistical Practice

被引:142
作者
Gelman, Andrew [1 ,2 ]
机构
[1] Columbia Univ, Dept Stat, New York, NY 10027 USA
[2] Columbia Univ, Dept Polit Sci, New York, NY 10027 USA
关键词
D O I
10.1097/EDE.0b013e31827886f7
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Like many Bayesians, I have often represented classical confidence intervals as posterior probability intervals and interpreted one-sided P values as the posterior probability of a positive effect. These are valid conditional on the assumed noninformative prior but typically do not make sense as unconditional probability statements. As Sander Greenland has discussed in much of his work over the years, epidemiologists and applied scientists in general have knowledge of the sizes of plausible effects and biases. I believe that a direct interpretation of P values as posterior probabilities can be a useful start-if we recognize that such summaries systematically overestimate the strength of claims from any particular dataset. In this way, I am in agreement with Greenland and Poole's interpretation of the one-sided P value as a lower bound of a posterior probability, although I am less convinced of the practical utility of this bound, given that the closeness of the bound depends on a combination of sample size and prior distribution.The default conclusion from a noninformative prior analysis will almost invariably put too much probability on extreme values. A vague prior distribution assigns much of its probability on values that are never going to be plausible, and this disturbs the posterior probabilities more than we tend to expect-something that we probably do not think about enough in our routine applications of standard statistical methods. Greenland and Poole1 perform a valuable service by opening up these calculations and placing them in an applied context. © 2012 by Lippincott William & Wilkins.
引用
收藏
页码:69 / 72
页数:4
相关论文
共 13 条
[1]  
[Anonymous], 1908, BIOMETRIKA, V6, P1
[2]   Feeling the Future: Experimental Evidence for Anomalous Retroactive Influences on Cognition and Affect [J].
Bem, Daryl J. .
JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 2011, 100 (03) :407-425
[3]   RECONCILING BAYESIAN AND FREQUENTIST EVIDENCE IN THE ONE-SIDED TESTING PROBLEM [J].
CASELLA, G ;
BERGER, RL .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1987, 82 (397) :106-111
[4]   ENHANCING DEMOCRACY THROUGH LEGISLATIVE REDISTRICTING [J].
GELMAN, A ;
KING, G .
AMERICAN POLITICAL SCIENCE REVIEW, 1994, 88 (03) :541-559
[5]  
Gelman A., 2004, CHANCE, V17, P28
[6]   The difference between "significant" and "not significant" is not itself statistically significant [J].
Gelman, Andrew ;
Stern, Hal .
AMERICAN STATISTICIAN, 2006, 60 (04) :328-331
[7]   Of Beauty, Sex and Power [J].
Gelman, Andrew ;
Weakliem, David .
AMERICAN SCIENTIST, 2009, 97 (04) :310-316
[8]   Living with P Values Resurrecting a Bayesian Perspective on Frequentist Statistics [J].
Greenland, Sander ;
Poole, Charles .
EPIDEMIOLOGY, 2013, 24 (01) :62-68
[9]   Null misinterpretation in statistical testing and its impact on health risk assessment [J].
Greenland, Sander .
PREVENTIVE MEDICINE, 2011, 53 (4-5) :225-228
[10]   Beautiful parents have more daughters: A further implication of the generalized Trivers-Willard hypothesis (gTWH) [J].
Kanazawa, Satoshi .
JOURNAL OF THEORETICAL BIOLOGY, 2007, 244 (01) :133-140