A new test for sphericity of the covariance matrix for high dimensional data

被引:47
作者
Fisher, Thomas J. [1 ]
Sun, Xiaogian [2 ]
Gallagher, Colin M. [2 ]
机构
[1] Univ Missouri, Dept Math & Stat, Kansas City, MO 64110 USA
[2] Clemson Univ, Dept Math Sci, Clemson, SC 29634 USA
关键词
Covariance matrix; Hypothesis testing; High-dimensional data analysis; GENE-EXPRESSION DATA; CLASSIFICATION; DISCRIMINATION; CONVERGENCE; EIGENVALUES; EQUALITY; CRITERIA; TUMOR;
D O I
10.1016/j.jmva.2010.07.004
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper we propose a new test procedure for sphericity of the covariance matrix when the dimensionality, p, exceeds that of the sample size, N = n + 1. Under the assumptions that (A) 0 < tr Sigma(i)/p < infinity as p -> infinity for i = 1,..., 16 and (B) p/n -> c < infinity known as the concentration, a new statistic is developed utilizing the ratio of the fourth and second arithmetic means of the eigenvalues of the sample covariance matrix. The newly defined test has many desirable general asymptotic properties, such as normality and consistency when (n, p) -> infinity. Our simulation results show that the new test is comparable to, and in some cases more powerful than, the tests for sphericity in the current literature. (c) 2010 Elsevier Inc. All rights reserved.
引用
收藏
页码:2554 / 2570
页数:17
相关论文
共 24 条
[1]   Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].
Alon, U ;
Barkai, N ;
Notterman, DA ;
Gish, K ;
Ybarra, S ;
Mack, D ;
Levine, AJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750
[2]  
[Anonymous], 1996, GRADUATE TEXTS MATH
[3]  
Arharov L. V., 1971, Dokl. Akad. Nauk SSSR, V199, P994
[4]   ON RATES OF CONVERGENCE OF EFFICIENT DETECTION CRITERIA IN SIGNAL-PROCESSING WITH WHITE NOISE [J].
BAI, ZD ;
KRISHNAIAH, PR ;
ZHAO, LC .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1989, 35 (02) :380-388
[6]   Boosting for tumor classification with gene expression data [J].
Dettling, M ;
Bühlmann, P .
BIOINFORMATICS, 2003, 19 (09) :1061-1069
[7]   Comparison of discrimination methods for the classification of tumors using gene expression data [J].
Dudoit, S ;
Fridlyand, J ;
Speed, TP .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (457) :77-87
[8]  
Durrett R., 1996, PROBABILITY THEORY E
[9]  
FISHER TJ, 2009, THESIS CLEMSON U
[10]   Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring [J].
Golub, TR ;
Slonim, DK ;
Tamayo, P ;
Huard, C ;
Gaasenbeek, M ;
Mesirov, JP ;
Coller, H ;
Loh, ML ;
Downing, JR ;
Caligiuri, MA ;
Bloomfield, CD ;
Lander, ES .
SCIENCE, 1999, 286 (5439) :531-537