Performance of five two-sample location tests for skewed distributions with unequal variances

被引:142
作者
Fagerland, Morten W. [1 ]
Sandvik, Leiv [1 ]
机构
[1] Oslo Univ Hosp, Ulleval Dept Res Adm, N-0407 Oslo, Norway
关键词
Two-sample location problem; T test; Welch test; Wilcoxon-Mann-Whitney test; Yuen-Welch test; Brunner-Munzel test; Robustness; Skewness; Heteroscedasticity; T-TEST; ROBUSTNESS; STATISTICS;
D O I
10.1016/j.cct.2009.06.007
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
Tests for comparing the locations of two independent populations are associated with different null hypotheses, but results are often interpreted as evidence for or against equality of means or medians. We examine the appropriateness of this practice by investigating the performance of five frequently used tests: the two-sample T test, the Welch U test, the Yuen-Welch test, the Wilcoxon-Mann-Whitney test, and the Brunner-Munzel test. Under combined violations of normality and variance homogeneity, the true significance level and power of the tests depend on a complex interplay of several factors. In a wide ranging simulation study, we consider scenarios differing in skewness, skewness heterogeneity, variance heterogeneity, sample size, and sample size ratio. We find that small differences in distribution properties can alter test performance markedly, thus confounding the effort to present simple test recommendations. Instead, we provide detailed recommendations in Appendix A. The Welch U test is recommended most frequently, but cannot be considered an omnibus test for this problem. (C) 2009 Elsevier Inc. All rights reserved.
引用
收藏
页码:490 / 496
页数:7
相关论文
共 28 条
[1]  
[Anonymous], R Project for Statistical Computing (Version 3.0.2)
[2]   WELCH APPROXIMATE SOLUTION FOR THE BEHRENS-FISHER PROBLEM [J].
BEST, DJ ;
RAYNER, JCW .
TECHNOMETRICS, 1987, 29 (02) :205-210
[3]   ROBUSTNESS [J].
BRADLEY, JV .
BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 1978, 31 (NOV) :144-152
[4]   Increasing physicians' awareness of the impact of statistics on research outcomes: Comparative power of the t-test and Wilcoxon rank-sum test in small samples applied research [J].
Bridge, PD ;
Sawilowsky, SS .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 1999, 52 (03) :229-235
[5]  
Brunner E, 2000, BIOMETRICAL J, V42, P17, DOI 10.1002/(SICI)1521-4036(200001)42:1<17::AID-BIMJ17>3.0.CO
[6]  
2-U
[7]   DOMINANCE STATISTICS - ORDINAL ANALYSES TO ANSWER ORDINAL QUESTIONS [J].
CLIFF, N .
PSYCHOLOGICAL BULLETIN, 1993, 114 (03) :494-509
[8]   HOW TO USE THE 2 SAMPLE TERT-TEST [J].
CRESSIE, NAC ;
WHITFORD, HJ .
BIOMETRICAL JOURNAL, 1986, 28 (02) :131-148
[9]   Conventional-dose hormone therapy (HT) and tibolone, but not low-dose HT and raloxifene, increase markers of activated coagulation [J].
Eilertsen, A. L. ;
Qvigstad, E. ;
Andersen, T. O. ;
Sandvik, L. ;
Sandset, P. M. .
MATURITAS, 2006, 55 (03) :278-287
[10]  
Evans M., 2000, STAT DISTRIBUTIONS