Confidence intervals for the Mann-Whitney test

被引：37

作者：

Perme, Maja Pohar ^{[1
]}

Manevski, Damjan ^{[1
]}

机构：

[1] Univ Ljubljana, Fac Med, Inst Biostat & Med Informat, Vrazov Trg 2, Ljubljana 1000, Slovenia

来源：

STATISTICAL METHODS IN MEDICAL RESEARCH | 2019年 / 28卷 / 12期

关键词：

Mann-Whitney; confidence interval; area under ROC curve; effect size; small sample size; probabilistic index; ROC CURVE; AREA; INFERENCE;

D O I：

10.1177/0962280218814556

中图分类号：

R19 [保健组织与事业（卫生事业管理）];

学科分类号：

摘要：

The Mann-Whitney test is a commonly used non-parametric alternative of the two-sample t-test. Despite its frequent use, it is only rarely accompanied with confidence intervals of an effect size. If reported, the effect size is usually measured with the difference of medians or the shift of the two distribution locations. Neither of these two measures directly coincides with the test statistic of the Mann-Whitney test, so the interpretation of the test results and the confidence intervals may be importantly different. In this paper, we focus on the probability that random variable X is lower than random variable Y. This measure is often referred to as the degree of overlap or the probabilistic index; it is in one-to-one relationship with the Mann-Whitney test statistic. The measure equals the area under the ROC curve. Several methods have been proposed for the construction of the confidence interval for this measure, and we review the most promising ones and explain their ideas. We study the properties of different variance estimators and small sample problems of confidence intervals construction. We identify scenarios in which the existing approaches yield inadequate coverage probabilities. We conclude that the DeLong variance estimator is a reliable option regardless of the scenario, but confidence intervals should be constructed using the logit scale to avoid values above 1 or below 0 and the poor coverage probability that follows. A correction is needed for the case when all values from one sample are smaller than the values of the other. We propose a method that improves the coverage probability also in these cases.

引用

页码：3755 / 3768

页数：14

共 22 条

[1] Small sample inference for probabilistic index models [J].