Joint test for homogeneity of high-dimensional means and covariance matrices using maximum-type statistics

被引:0
作者
Miao, Runsheng [1 ]
Xu, Kai [2 ]
机构
[1] Univ Shanghai Sci & Technol, Coll Commun & Art Design, Shanghai, Peoples R China
[2] Anhui Normal Univ, Sch Math & Stat, Wuhu, Peoples R China
基金
中国国家自然科学基金;
关键词
Extreme value distribution; High dimension; Maximum-type statistic; Multiplier bootstrap; Simultaneous test; 2-SAMPLE TEST; CLASSIFICATION; BOOTSTRAP; VECTOR;
D O I
10.1080/03610918.2022.2037641
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Several tests for simultaneously checking homogeneity of means and covariance matrices, based on the classical likelihood ratio and sum-of-squares type test statistics, have been proposed in the literature. Despite their usefulness, they tend to have unsatisfactory size performance for either nonnormal high-dimensional data or strongly spiked eigenvalue models. This article proposes a novel high-dimensional nonparametric test using the maximum-type statistics. The limiting null distribution of the proposed test statistic is derived to provide p-values. It turns out that under appropriate conditions, the test is particularly powerful against sparse alternatives. Since the unknown correlations among the data sometimes pose a great challenge toward the accurate p-value calculation of the test, we further use a parametric bootstrap technique to achieve size accuracy. We also prove that the proposed simulation-based testing procedure is asymptotically valid in terms of size and power even if the dimension of the data is much larger than the sample size. We demonstrate the effectiveness of our methods through extensive simulation studies and a real data application.
引用
收藏
页码:972 / 992
页数:21
相关论文
共 24 条
[1]  
Anderson TW., 2003, An Introduction to Multivariate Statistical Analysis, V3
[2]  
[Anonymous], 1983, Extremes and related properties of random sequences and processes
[3]   TWO-SAMPLE TESTS FOR HIGH-DIMENSION, STRONGLY SPIKED EIGENVALUE MODELS [J].
Aoshima, Makoto ;
Yata, Kazuyoshi .
STATISTICA SINICA, 2018, 28 (01) :43-62
[4]   A nonparametric two-sample test applicable to high dimensional data [J].
Biswas, Munmun ;
Ghosh, Anil K. .
JOURNAL OF MULTIVARIATE ANALYSIS, 2014, 123 :160-171
[5]   Two-sample test of high dimensional means under dependence [J].
Cai, T. Tony ;
Liu, Weidong ;
Xia, Yin .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2014, 76 (02) :349-372
[6]   Two-Sample Covariance Matrix Testing and Support Recovery in High-Dimensional and Sparse Settings [J].
Cai, Tony ;
Liu, Weidong ;
Xia, Yin .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2013, 108 (501) :265-277
[7]   Simulation-based hypothesis testing of high dimensional means under covariance heterogeneity [J].
Chang, Jinyuan ;
Zheng, Chao ;
Zhou, Wen-Xin ;
Zhou, Wen .
BIOMETRICS, 2017, 73 (04) :1300-1310
[8]   Comparing Large Covariance Matrices under Weak Conditions on the Dependence Structure and Its Application to Gene Clustering [J].
Chang, Jinyuan ;
Zhou, Wen ;
Zhou, Wen-Xin ;
Wang, Lan .
BIOMETRICS, 2017, 73 (01) :31-41
[9]   A TWO-SAMPLE TEST FOR HIGH-DIMENSIONAL DATA WITH APPLICATIONS TO GENE-SET TESTING [J].
Chen, Song Xi ;
Qin, Ying-Li .
ANNALS OF STATISTICS, 2010, 38 (02) :808-835
[10]   CENTRAL LIMIT THEOREMS AND BOOTSTRAP IN HIGH DIMENSIONS [J].
Chernozhukov, Victor ;
Chetverikov, Denis ;
Kato, Kengo .
ANNALS OF PROBABILITY, 2017, 45 (04) :2309-2352