Sample canonical correlation coefficients of high-dimensional random vectors with finite rank correlations

被引:2
作者
Ma, Zongming [1 ]
Yang, Fan [1 ]
机构
[1] Univ Penn, Dept Stat & Data Sci, Philadelphia, PA 19104 USA
关键词
Canonical correlation analysis; BBP transition; Tracy-Widom law; edge eigenvalues; CENTRAL LIMIT-THEOREMS; LARGEST EIGENVALUE; MULTIVARIATE-ANALYSIS; PRINCIPAL COMPONENTS; COVARIANCE MATRICES; DISTRIBUTIONS; DEFORMATION; SPECTRUM; OUTLIERS; CCA;
D O I
10.3150/22-BEJ1525
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Consider two random vectors (similar to)x = Az + C-1/2 (1) x is an element of R-p and (similar to)y = Bz + C-1/2 (2) y is an element of R-q, where x is an element of R-p, y is an element of R-q and z is an element of R-r are independent random vectors with i.i.d. entries of zero mean and unit variance, C-1 and C-2 are p x p and q x q deterministic population covariance matrices, and A and B are p x r and q x r deterministic factor loading matrices. With n independent observations of (similar to)x and (similar to)y, we study the sample canonical correlations between them. Under the sharp fourth moment condition on the entries of x, y and z, we prove the BBP transition for the sample canonical correlation coefficients (CCCs). More precisely, if a population CCC is below a threshold, then the corresponding sample CCC converges to the right edge of the bulk eigenvalue spectrum of the sample canonical correlation matrix and satisfies the famous Tracy-Widom law; if a population CCC is above the threshold, then the corresponding sample CCC converges to an outlier that is detached from the bulk eigenvalue spectrum. We prove our results in full generality, in the sense that they also hold for near-degenerate population CCCs and population CCCs that are close to the threshold.
引用
收藏
页码:1905 / 1932
页数:28
相关论文
共 50 条
  • [31] TWO SAMPLE TESTS FOR HIGH-DIMENSIONAL COVARIANCE MATRICES
    Li, Jun
    Chen, Song Xi
    ANNALS OF STATISTICS, 2012, 40 (02) : 908 - 940
  • [32] A New Statistic for Testing Covariance Equality in High-Dimensional Gaussian Low-Rank Models
    Beisson, Remi
    Vallet, Pascal
    Giremus, Audrey
    Ginolhac, Guillaume
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2024, 72 : 1797 - 1812
  • [33] High-dimensional consistency of rank estimation criteria in multivariate linear model
    Fujikoshi, Yasunori
    Sakurai, Tetsuro
    JOURNAL OF MULTIVARIATE ANALYSIS, 2016, 149 : 199 - 212
  • [34] Multi-sample test for high-dimensional covariance matrices
    Zhang, Chao
    Bai, Zhidong
    Hu, Jiang
    Wang, Chen
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2018, 47 (13) : 3161 - 3177
  • [35] RANDOM MATRIX-OPTIMIZED HIGH-DIMENSIONAL MVDR BEAMFORMING
    Yang, Liusha
    McKay, Matthew
    Couillet, Romain
    2018 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP (SSP), 2018, : 473 - 477
  • [36] Large-sample approximations for variance-covariance matrices of high-dimensional time series
    Steland, Ansgar
    Von Sachs, Rainer
    BERNOULLI, 2017, 23 (4A) : 2299 - 2329
  • [37] Spectral analysis of high-dimensional sample covariance matrices with missing observations
    Jurczak, Kamil
    Rohde, Angelika
    BERNOULLI, 2017, 23 (4A) : 2466 - 2532
  • [38] Large deviations for high-dimensional random projections of lpn-balls
    Alonso-Gutierrez, David
    Prochno, Joscha
    Thaele, Christoph
    ADVANCES IN APPLIED MATHEMATICS, 2018, 99 : 1 - 35
  • [39] Computational Information Geometry for Binary Classification of High-Dimensional Random Tensors
    Pham, Gia-Thuy
    Boyer, Remy
    Nielsen, Frank
    ENTROPY, 2018, 20 (03)
  • [40] Simple and reliable estimators of coefficients of interest in a model with high-dimensional confounding effects
    Galbraith, John W.
    Zinde-Walsh, Victoria
    JOURNAL OF ECONOMETRICS, 2020, 218 (02) : 609 - 632