Testing the Mean Matrix in High-Dimensional Transposable Data

被引:10
作者
Touloumis, Anestis [1 ]
Tavare, Simon [1 ]
Marioni, John C. [2 ]
机构
[1] Univ Cambridge, Canc Res UK Cambridge Inst, Cambridge CB2 0RE, England
[2] EMBL European Bioinformat Inst, Hinxton CB10 1SD, England
关键词
High-dimensional transposable data; Hypothesis testing; Mean matrix; Nonparametric test; COVARIANCE; SELECTION; MODELS; SET;
D O I
10.1111/biom.12257
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The structural information in high-dimensional transposable data allows us to write the data recorded for each subject in a matrix such that both the rows and the columns correspond to variables of interest. One important problem is to test the null hypothesis that the mean matrix has a particular structure without ignoring the dependence structure among and/or between the row and column variables. To address this, we develop a generic and computationally inexpensive nonparametric testing procedure to assess the hypothesis that, in each predefined subset of columns (rows), the column (row) mean vector remains constant. In simulation studies, the proposed testing procedure seems to have good performance and, unlike simple practical approaches, it preserves the nominal size and remains powerful even if the row and/or column variables are not independent. Finally, we illustrate the use of the proposed methodology via two empirical examples from gene expression microarrays.
引用
收藏
页码:157 / 166
页数:10
相关论文
共 27 条
[1]  
Allen GI, 2012, J ROY STAT SOC B, V74, P721, DOI 10.1111/j.1467-9868.2011.01027.x
[2]   TRANSPOSABLE REGULARIZED COVARIANCE MODELS WITH AN APPLICATION TO MISSING DATA IMPUTATION [J].
Allen, Genevera I. ;
Tibshirani, Robert .
ANNALS OF APPLIED STATISTICS, 2010, 4 (02) :764-790
[3]  
Bai ZD, 1996, STAT SINICA, V6, P311
[4]   Dynamic Matrix-Variate Graphical Models [J].
Carvalho, Carlos M. ;
West, Mike .
BAYESIAN ANALYSIS, 2007, 2 (01) :69-97
[5]   Tests for High-Dimensional Covariance Matrices [J].
Chen, Song Xi ;
Zhang, Li-Xin ;
Zhong, Ping-Shou .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2010, 105 (490) :810-819
[6]   A TWO-SAMPLE TEST FOR HIGH-DIMENSIONAL DATA WITH APPLICATIONS TO GENE-SET TESTING [J].
Chen, Song Xi ;
Qin, Ying-Li .
ANNALS OF STATISTICS, 2010, 38 (02) :808-835
[7]   Molecular Pathways: Current Role and Future Directions of the Retinoic Acid Pathway in Cancer Prevention and Treatment [J].
Connolly, Roisin M. ;
Nguyen, Nguyen K. ;
Sukumar, Saraswati .
CLINICAL CANCER RESEARCH, 2013, 19 (07) :1651-1659
[8]  
DAWID AP, 1981, BIOMETRIKA, V68, P265, DOI 10.1093/biomet/68.1.265
[9]   Evidence Based Selection of Housekeeping Genes [J].
de Jonge, Hendrik J. M. ;
Fehrmann, Rudolf S. N. ;
de Bont, Eveline S. J. M. ;
Hofstra, Robert M. W. ;
Gerbens, Frans ;
Kamps, Willem A. ;
de Vries, Elisabeth G. E. ;
van der Zee, Ate G. J. ;
Meerman, Gerard J. te ;
ter Elst, Arja .
PLOS ONE, 2007, 2 (09)
[10]   ARE A SET OF MICROARRAYS INDEPENDENT OF EACH OTHER? [J].
Efron, Bradley .
ANNALS OF APPLIED STATISTICS, 2009, 3 (03) :922-942