LARGEST ENTRIES OF SAMPLE CORRELATION MATRICES FROM EQUI-CORRELATED NORMAL POPULATIONS
被引:16
|
作者:
Fan, Jianqing
论文数: 0引用数: 0
h-index: 0
机构:
Princeton Univ, Dept Operat Res & Financial Engn, 205 Sherred Hall, Princeton, NJ 08544 USAPrinceton Univ, Dept Operat Res & Financial Engn, 205 Sherred Hall, Princeton, NJ 08544 USA
Fan, Jianqing
[1
]
Jiang, Tiefeng
论文数: 0引用数: 0
h-index: 0
机构:
Univ Minnesota, Sch Stat, 313 Ford Hall,224 Church St SE, Minneapolis, MN 55455 USAPrinceton Univ, Dept Operat Res & Financial Engn, 205 Sherred Hall, Princeton, NJ 08544 USA
Jiang, Tiefeng
[2
]
机构:
[1] Princeton Univ, Dept Operat Res & Financial Engn, 205 Sherred Hall, Princeton, NJ 08544 USA
[2] Univ Minnesota, Sch Stat, 313 Ford Hall,224 Church St SE, Minneapolis, MN 55455 USA
Maximum sample correlation;
phase transition;
multivariate normal distribution;
Gumbel distribution;
Chen-Stein Poisson approximation;
ASYMPTOTIC-DISTRIBUTION;
LIMIT-THEOREMS;
DISTRIBUTIONS;
COHERENCE;
APPROXIMATIONS;
D O I:
10.1214/19-AOP1341
中图分类号:
O21 [概率论与数理统计];
C8 [统计学];
学科分类号:
020208 ;
070103 ;
0714 ;
摘要:
The paper studies the limiting distribution of the largest off-diagonal entry of the sample correlation matrices of high-dimensional Gaussian populations with equi-correlation structure. Assume the entries of the population distribution have a common correlation coefficient rho > 0 and both the population dimension p and the sample size n tend to infinity with log p = o(n(1/3)). As 0 < rho < 1, we prove that the largest off-diagonal entry of the sample correlation matrix converges to a Gaussian distribution, and the same is true for the sample covariance matrix as 0 < rho < 1/2. This differs substantially from a well-known result for the independent case where rho = 0, in which the above limiting distribution is an extreme-value distribution. We then study the phase transition between these two limiting distributions and identify the regime of rho where the transition occurs. If. is less than, larger than or is equal to the threshold, the corresponding limiting distribution is the extreme-value distribution, the Gaussian distribution and a convolution of the two distributions, respectively. The proofs rely on a subtle use of the Chen-Stein Poisson approximation method, conditioning, a coupling to create independence and a special property of sample correlation matrices. An application is given for a statistical testing problem.