When Your Permutation Test is Doomed to Fail

被引:8
作者
Christensen, William F. [1 ]
Zabriskie, Brinley N. [1 ]
机构
[1] Brigham Young Univ, Dept Stat, Provo, UT 84602 USA
关键词
Asymmetric permutation distribution; Distribution free; Nonparametric; Power; Randomization test; Skewness; Test for location; Two-sided p-value; Unbalanced data;
D O I
10.1080/00031305.2021.1902856
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A two-tailed test comparing the means of two independent populations is perhaps the most commonly used hypothesis test in quantitative research, featured centrally in medical research, A/B testing, and throughout the sciences. When data are skewed, the standard two-tailed t test is not appropriate and the permutation test comparing the two means (or medians) has been a widely recommended alternative, with statistical authors and statistical software packages touting the permutation test's utility, particularly for small samples. In this presentation, we illustrate that when the two samples are skewed and the sample sizes are unequal, the two-tailed permutation test (as traditionally implemented) can in some cases have power equal to zero, even when the k highest values in the combined data are all found in the group with k observations. Further, in many cases the standard permutation test exhibits decreasing power as the total sample size increases! We illustrate the causes of these perverse properties via both simulation and real-world examples, and we recommend approaches for ameliorating or avoiding these potential problems.
引用
收藏
页码:53 / 63
页数:11
相关论文
共 52 条
[1]   A permutation approach for ranking of multivariate populations [J].
Arboretti, Rosa ;
Bonnini, Stefano ;
Corain, Livio ;
Salmaso, Luigi .
JOURNAL OF MULTIVARIATE ANALYSIS, 2014, 132 :39-57
[2]  
Basso D., 2009, Permutation Tests for Stochastic Ordering and ANOVA
[3]   A permutation test for umbrella alternatives [J].
Basso, Dario ;
Salmaso, Luigi .
STATISTICS AND COMPUTING, 2011, 21 (01) :45-54
[4]   AN ANALYSIS OF TRANSFORMATIONS [J].
BOX, GEP ;
COX, DR .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1964, 26 (02) :211-252
[5]  
Chihara L., 2011, Mathematical Statistics with Resampling and R, DOI DOI 10.1002/9781119505969
[6]  
de Winter J. C. F., 2013, Practical Assessment, Research Evaluation, V18, DOI [https://doi.org/10.7275/E4R6-DJ05, DOI 10.7275/E4R6-DJ05, 10.7275/E4R6-DJ05]
[7]  
Dibble W.J., 2009, Draugh Surveys: A guide to good practice
[8]  
Dubey S.D., 1991, J BIOPHARM STAT, V1, DOI 10.1080/10543409108835011
[9]  
Experian, Data as a Force for Good
[10]  
Fisher L.D., 1991, J BIOPHARM STAT, V1, DOI 10.1080/10543409108835012