A simulation study of a class of nonparametric test statistics: a close look of empirical distribution function-based tests

被引:3
作者
Zheng, Wenjun [1 ]
Lai, Dejian [1 ]
Gould, K. Lance [2 ]
机构
[1] Univ Texas Hlth Sci Ctr Houston, Sch Publ Hlth, Dept Biostat & Data Sci, 1200 Pressler St, Houston, TX 77030 USA
[2] Univ Texas Hlth Sci Ctr Houston, Weatherhead PET Imaging Ctr, McGovern Med Sch, Houston, TX 77030 USA
关键词
Nonparametric; Kolmogorov– Smirnov; Discontinuous; Empirical distribution function; Correlation;
D O I
10.1080/03610918.2021.1874987
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Kolmogorov-Smirnov (KS) statistic is a non-parametric statistic based on the empirical distribution function. For the one-sample case, it uses the supremum distance between an empirical distribution function (EDF) and a pre-specified cumulative distribution function (CDF). For two-sample case, it measures the maximum of the distance between two EDFs. KS test, as well as other EDF-based tests such as the Anderson-Darling (AD) test and Cramer-von Mises (CvM) test, has been widely used in statistical analysis. To address and compare the performance of these test statistics, we have conducted a simulation study comparing the type I error and power of the KS test, the CvM test, the AD test, and the Chi-squared test. Our study includes both one sample and two sample tests and for both independent and correlated samples. Our study showed that if we do not have prior information about the tested distributions, EDF-based tests are better. However, so long as we have prior information about the tested distribution and the density of two distributions is bell-shaped and we are expecting differences in variance/sparseness, then the Chi-squared test may be more preferable. When correlation exists between tested samples, adjustment on the informative sample size is important and required.
引用
收藏
页码:1133 / +
页数:17
相关论文
共 26 条
[1]   ASYMPTOTIC THEORY OF CERTAIN GOODNESS OF FIT CRITERIA BASED ON STOCHASTIC PROCESSES [J].
ANDERSON, TW ;
DARLING, DA .
ANNALS OF MATHEMATICAL STATISTICS, 1952, 23 (02) :193-212
[2]  
BROWN BM, 1994, J R STAT SOC B, V56, P275
[3]  
BROWN BM, 1982, BIOMETRIKA, V69, P619
[4]   CRAMER-VONMISES STATISTICS FOR DISCRETE-DISTRIBUTIONS [J].
CHOULAKIAN, V ;
LOCKHART, RA ;
STEPHENS, MA .
CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 1994, 22 (01) :125-137
[5]   ASSESSING THE SIGNIFICANCE OF THE CORRELATION BETWEEN 2 SPATIAL PROCESSES [J].
CLIFFORD, P ;
RICHARDSON, S ;
HEMON, D .
BIOMETRICS, 1989, 45 (01) :123-134
[7]   The effects of nonnormality on parametric, nonparametric, and model comparison approaches to pairwise comparisons [J].
Cribbie, RA ;
Keselman, HJ .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2003, 63 (04) :615-635
[8]  
CRUTCHER HL, 1975, J APPL METEOROL, V14, P1600, DOI 10.1175/1520-0450(1975)014<1600:ANOTPM>2.0.CO
[9]  
2
[10]  
D'Agostino R.B., 1986, GOODNESS OF FIT TECH