HYPOTHESIS TESTING FOR THE COVARIANCE MATRIX IN HIGH-DIMENSIONAL TRANSPOSABLE DATA WITH KRONECKER PRODUCT DEPENDENCE STRUCTURE

被引:2
作者
Touloumis, Anestis [1 ]
Marioni, John C. [2 ,3 ]
Tavare, Simon [4 ]
机构
[1] Univ Brighton, Ctr Secure Intelligent & Usable Syst, Brighton BN2 4GJ, E Sussex, England
[2] Univ Cambridge, Cambridge CB2 0RE, England
[3] EMBL EBI Wellcome Genome Campus, Hinxton CB10 1SD, England
[4] Columbia Univ, New York, NY 10027 USA
关键词
Covariance matrix; high-dimensional settings; hypothesis testing; matrix-valued random variables; transposable data; MODELS; SET;
D O I
10.5705/ss.202018.0268
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The matrix-variate normal distribution is a popular model for high-dimensional transposable data because it decomposes the dependence structure of the random matrix into the Kronecker product of two covariance matrices, one for each of the row and column variables. However, few hypothesis testing procedures exist for these covariance matrices in high-dimensional settings. Therefore, we propose tests that assess the sphericity, identity, and diagonality hypotheses for the row (column) covariance matrix in a high-dimensional setting, while treating the column (row) dependence structure as a "nuisance" parameter. The proposed tests are robust to normality departures, provided that the Kronecker product dependence structure holds. In simulations, the proposed tests appear to maintain the nominal level, and tend to be powerful against the alternative hypotheses tested. The utility of the proposed tests is demonstrated by analyzing a microarray and an electroencephalogram study. The proposed testing methodology is implemented in the R package HDTD.
引用
收藏
页码:1309 / 1329
页数:21
相关论文
共 34 条
[1]   Tests of Covariance Matrices for High Dimensional Multivariate Data Under Non Normality [J].
Ahmad, M. Rauf ;
Von Rosen, Dietrich .
COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2015, 44 (07) :1387-1398
[2]  
Allen GI, 2012, J ROY STAT SOC B, V74, P721, DOI 10.1111/j.1467-9868.2011.01027.x
[3]   TRANSPOSABLE REGULARIZED COVARIANCE MODELS WITH AN APPLICATION TO MISSING DATA IMPUTATION [J].
Allen, Genevera I. ;
Tibshirani, Robert .
ANNALS OF APPLIED STATISTICS, 2010, 4 (02) :764-790
[4]  
[Anonymous], 2010, LECT NOTES STAT, DOI DOI 10.1007/978-1-4419-0468-3
[5]   TESTS FOR SEPARABILITY IN NONPARAMETRIC COVARIANCE OPERATORS OF RANDOM SURFACES [J].
Aston, John A. D. ;
Pigoli, Davide ;
Tavakoli, Shahin .
ANNALS OF STATISTICS, 2017, 45 (04) :1431-1461
[6]   Dynamic Matrix-Variate Graphical Models [J].
Carvalho, Carlos M. ;
West, Mike .
BAYESIAN ANALYSIS, 2007, 2 (01) :69-97
[7]   Tests for High-Dimensional Covariance Matrices [J].
Chen, Song Xi ;
Zhang, Li-Xin ;
Zhong, Ping-Shou .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2010, 105 (490) :810-819
[8]   A TWO-SAMPLE TEST FOR HIGH-DIMENSIONAL DATA WITH APPLICATIONS TO GENE-SET TESTING [J].
Chen, Song Xi ;
Qin, Ying-Li .
ANNALS OF STATISTICS, 2010, 38 (02) :808-835
[9]  
DAWID AP, 1981, BIOMETRIKA, V68, P265, DOI 10.1093/biomet/68.1.265
[10]   ARE A SET OF MICROARRAYS INDEPENDENT OF EACH OTHER? [J].
Efron, Bradley .
ANNALS OF APPLIED STATISTICS, 2009, 3 (03) :922-942