The p-Value You Can't Buy

被引:57
作者
Demidenko, Eugene [1 ,2 ]
机构
[1] Dartmouth Coll, Dept Biomed Data Sci, Hanover, NH 03755 USA
[2] Dartmouth Coll, Dept Math, Hanover, NH 03755 USA
关键词
Discrimination error; Effect size; ROC curve; Significance testing; EFFECT SIZE; CONFIDENCE-INTERVALS; AREA;
D O I
10.1080/00031305.2015.1069760
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
There is growing frustration with the concept of the p-value. Besides having an ambiguous interpretation, the p-value can be made as small as desired by increasing the sample size, n. The p-value is outdated and does not make sense with big data: Everything becomes statistically significant. The root of the problem with the p-value is in the mean comparison. We argue that statistical uncertainty should be measured on the individual, not the group, level. Consequently, standard deviation (SD), not standard error (SE), error bars should be used to graphically present the data on two groups. We introduce a new measure based on the discrimination of individuals/objects from two groups, and call it the D-value. The D-value can be viewed as the n-of-1 p-value because it is computed in the same way as p while letting n equal 1. We show how the D-value is related to discrimination probability and the area above the receiver operating characteristic (ROC) curve. The D-value has a clear interpretation as the proportion of patients who get worse after the treatment, and as such facilitates to weigh up the likelihood of events under different scenarios.
引用
收藏
页码:33 / 38
页数:6
相关论文
共 32 条
[1]  
Adler J., 2014, REFORMATION
[2]  
Akinboro O, 2014, ANN INTERN MED, V161, P531, DOI [10.7326/L14-5019, 10.7326/M13-1921, 10.7326/L14-5019-2]
[3]   Rescuing US biomedical research from its systemic flaws [J].
Alberts, Bruce ;
Kirschner, Marc W. ;
Tilghman, Shirley ;
Varmus, Harold .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2014, 111 (16) :5773-5777
[4]   Cellular Heterogeneity: Do Differences Make a Difference? [J].
Altschuler, Steven J. ;
Wu, Lani F. .
CELL, 2010, 141 (04) :559-563
[5]  
[Anonymous], 2010, Science News, DOI [DOI 10.1002/SCIN.5591770721, 10.1002/scin.5591770721, DOI 10.1002/scin.5591770721, 10.1002/scin.5591770721C, DOI 10.1002/SCIN.5591770721C]
[6]   AREA ABOVE ORDINAL DOMINANCE GRAPH AND AREA BELOW RECEIVER OPERATING CHARACTERISTIC GRAPH [J].
BAMBER, D .
JOURNAL OF MATHEMATICAL PSYCHOLOGY, 1975, 12 (04) :387-415
[7]   The t-Test p Value and Its Relationship to the Effect Size and P(X > Y) [J].
Browne, Richard H. .
AMERICAN STATISTICIAN, 2010, 64 (01) :30-33
[8]  
COHEN J, 1994, AM PSYCHOL, V49, P997, DOI 10.1037/0003-066X.50.12.1103
[9]  
Demidenko E., 2013, Mixed models: Theory and Applications with R
[10]   Confidence intervals and bands for the binormal ROC curve revisited [J].
Demidenko, Eugene .
JOURNAL OF APPLIED STATISTICS, 2012, 39 (01) :67-79