A PAIRWISE HOTELLING METHOD FOR TESTING HIGH-DIMENSIONAL MEAN VECTORS

被引:0
作者
Hu, Zongliang [1 ]
Tong, Tiejun [2 ]
Genton, Marc G. [3 ]
机构
[1] Shenzhen Univ, Sch Math Sci, Shenzhen, Peoples R China
[2] Hong Kong Baptist Univ, Dept Math, Hong Kong, Peoples R China
[3] King Abdullah Univ Sci & Technol, Stat Program, Thuwal, Saudi Arabia
基金
中国国家自然科学基金;
关键词
High-dimensional data; Hotelling's test; pairwise correla- tion; screening; statistical power; type-I error rate; SMALL SAMPLE-SIZE; 2-SAMPLE TEST; T-2; TEST; FEWER OBSERVATIONS; EIGENVALUE; EQUALITY;
D O I
10.5705/ss.202021.0369
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
For high-dimensional data with a small sample size, we cannot use Hotelling's T2 test to test the mean vectors because of the singularity problem in the sample covariance matrix. To overcome this problem, there are three main approaches but each has limitations and only works well in certain situations. Inspired by this, we propose a pairwise Hotelling method for testing highdimensional mean vectors that provides a good balance between the existing approaches. To use the correlation information efficiently, we construct the new test statistics as the sum of Hotelling's test statistics for the covariate pairs with strong correlations and the squared t-statistics for the individual covariates that have little correlation with others. We further derive the asymptotic null distributions and power functions for the proposed tests under some regularity conditions. Numerical results show that our tests are able to control the type-I error rates and achieve a higher statistical power than that of existing methods, especially when the covariates are highly correlated. Two real-data examples are used to demonstrate the efficacy of our pairwise Hotelling's tests.
引用
收藏
页码:229 / 256
页数:28
相关论文
共 44 条
[1]  
Ahmad MR, 2014, ANN I STAT MATH, V66, P33, DOI 10.1007/s10463-013-0404-2
[2]  
Anderson T., 2003, An introduction to multivariate statistical analysis
[3]   TWO-SAMPLE TESTS FOR HIGH-DIMENSION, STRONGLY SPIKED EIGENVALUE MODELS [J].
Aoshima, Makoto ;
Yata, Kazuyoshi .
STATISTICA SINICA, 2018, 28 (01) :43-62
[4]   Mean vector testing for high-dimensional dependent observations [J].
Ayyala, Deepak Nag ;
Park, Junyong ;
Roy, Anindya .
JOURNAL OF MULTIVARIATE ANALYSIS, 2017, 153 :136-155
[5]  
Bai ZD, 1996, STAT SINICA, V6, P311
[6]   Some theory for Fisher's linear discriminant function, 'naive Bayes', and some alternatives when there are many more variables than observations [J].
Bickel, PJ ;
Levina, E .
BERNOULLI, 2004, 10 (06) :989-1010
[7]   Two-sample test of high dimensional means under dependence [J].
Cai, T. Tony ;
Liu, Weidong ;
Xia, Yin .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2014, 76 (02) :349-372
[8]   Two-sample tests of high-dimensional means for compositional data [J].
Cao, Yuanpei ;
Lin, Wei ;
Li, Hongzhe .
BIOMETRIKA, 2018, 105 (01) :115-132
[9]   A Regularized Hotelling's T2 Test for Pathway Analysis in Proteomic Studies [J].
Chen, Lin S. ;
Paul, Debashis ;
Prentice, Ross L. ;
Wang, Pei .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (496) :1345-1360
[10]   TWO-SAMPLE AND ANOVA TESTS FOR HIGH DIMENSIONAL MEANS [J].
Chen, Song Xi ;
Li, Jun ;
Zhong, Ping-Shou .
ANNALS OF STATISTICS, 2019, 47 (03) :1443-1474