A probability-based measure of effect size: Robustness to base rates and other factors

被引:287
作者
Ruscio, John [1 ]
机构
[1] Coll New Jersey, Dept Psychiat, Ewing, NJ 08628 USA
关键词
effect size; nonparametric statistics; base rates; homogeneity of variance; independent groups;
D O I
10.1037/1082-989X.13.1.19
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Calculating and reporting appropriate measures of effect size are becoming standard practice in psychological research. One of the most common scenarios encountered involves the comparison of 2 groups, which includes research designs that are experimental (e.g., random assignment to treatment vs. placebo conditions) and nonexperimental (e.g., testing for gender differences). Familiar measures such as the standardized mean difference (d) or the point-biserial correlation (r(pb)) characterize the magnitude of the difference between groups, but these effect size measures are sensitive to a number of additional influences. For example, R. E. McGrath and G. J. Meyer (2006) showed that r(pb) is sensitive to sample base rates, and extending their analysis to situations of unequal variances reveals that d is, too. The probability-based measure A, the nonparametric generalization of what K. O. McGraw and S. P. Wong (1992) called the common language effect size statistic, is insensitive to base rates and more robust to several other factors (e.g., extreme scores, nonlinear transformations). In addition to its excellent generalizability across contexts, A is easy to understand and can be obtained from standard computer output or through simple hand calculations.
引用
收藏
页码:19 / 30
页数:12
相关论文
共 30 条
[1]   An alternative to Cohen's standardized mean difference effect size: A robust parameter and confidence interval in the two independent groups case [J].
Algina, J ;
Keselman, HJ ;
Penfield, RD .
PSYCHOLOGICAL METHODS, 2005, 10 (03) :317-328
[2]  
APA, 2001, Publication manual of the American Psychological Association, V5th, DOI DOI 10.1037/0000165-000
[3]   Screening for obsessive and compulsive symptoms: Validation of the Clark-Beck Obsessive-Compulsive Inventory [J].
Clark, DA ;
Antony, MM ;
Beck, AT ;
Swinson, RP ;
Steer, RA .
PSYCHOLOGICAL ASSESSMENT, 2005, 17 (02) :132-143
[4]   DOMINANCE STATISTICS - ORDINAL ANALYSES TO ANSWER ORDINAL QUESTIONS [J].
CLIFF, N .
PSYCHOLOGICAL BULLETIN, 1993, 114 (03) :494-509
[5]   THE COST OF DICHOTOMIZATION [J].
COHEN, J .
APPLIED PSYCHOLOGICAL MEASUREMENT, 1983, 7 (03) :249-253
[6]  
Cohen J., 1988, POWERSTATISTICALSCIE, DOI 10.4324/9780203771587
[7]   Comparing several robust tests of stochastic equality with ordinally scaled variables and small to moderate sized samples [J].
Delaney, HD ;
Vargha, A .
PSYCHOLOGICAL METHODS, 2002, 7 (04) :485-503
[8]   ROBUST RANK PROCEDURES FOR THE BEHRENS-FISHER PROBLEM [J].
FLIGNER, MA ;
POLICELLO, GE .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1981, 76 (373) :162-168
[9]  
FREEDMAN D, 1993, STATISTICS
[10]   Review of assumptions and problems in the appropriate conceptualization of effect size [J].
Grissom, RJ ;
Kim, JJ .
PSYCHOLOGICAL METHODS, 2001, 6 (02) :135-146