Sample canonical correlation coefficients of high-dimensional random vectors with finite rank correlations

被引:2
作者
Ma, Zongming [1 ]
Yang, Fan [1 ]
机构
[1] Univ Penn, Dept Stat & Data Sci, Philadelphia, PA 19104 USA
关键词
Canonical correlation analysis; BBP transition; Tracy-Widom law; edge eigenvalues; CENTRAL LIMIT-THEOREMS; LARGEST EIGENVALUE; MULTIVARIATE-ANALYSIS; PRINCIPAL COMPONENTS; COVARIANCE MATRICES; DISTRIBUTIONS; DEFORMATION; SPECTRUM; OUTLIERS; CCA;
D O I
10.3150/22-BEJ1525
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Consider two random vectors (similar to)x = Az + C-1/2 (1) x is an element of R-p and (similar to)y = Bz + C-1/2 (2) y is an element of R-q, where x is an element of R-p, y is an element of R-q and z is an element of R-r are independent random vectors with i.i.d. entries of zero mean and unit variance, C-1 and C-2 are p x p and q x q deterministic population covariance matrices, and A and B are p x r and q x r deterministic factor loading matrices. With n independent observations of (similar to)x and (similar to)y, we study the sample canonical correlations between them. Under the sharp fourth moment condition on the entries of x, y and z, we prove the BBP transition for the sample canonical correlation coefficients (CCCs). More precisely, if a population CCC is below a threshold, then the corresponding sample CCC converges to the right edge of the bulk eigenvalue spectrum of the sample canonical correlation matrix and satisfies the famous Tracy-Widom law; if a population CCC is above the threshold, then the corresponding sample CCC converges to an outlier that is detached from the bulk eigenvalue spectrum. We prove our results in full generality, in the sense that they also hold for near-degenerate population CCCs and population CCCs that are close to the threshold.
引用
收藏
页码:1905 / 1932
页数:28
相关论文
共 50 条
  • [41] On simultaneous confidence interval estimation for the difference of paired mean vectors in high-dimensional settings
    Hyodo, Masashi
    Watanabe, Hiroki
    Seo, Takashi
    JOURNAL OF MULTIVARIATE ANALYSIS, 2018, 168 : 160 - 173
  • [42] An RIHT statistic for testing the equality of several high-dimensional mean vectors under homoskedasticity
    Zhang, Qiuyan
    Wang, Chen
    Zhang, Baoxue
    Yang, Hu
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2024, 190
  • [43] HYPOTHESIS TESTING FOR BLOCK-STRUCTURED CORRELATION FOR HIGH-DIMENSIONAL VARIABLES
    Zheng, Shurong
    He, Xuming
    Guo, Jianhua
    STATISTICA SINICA, 2022, 32 (02) : 719 - 735
  • [44] Reduced rank regression with matrix projections for high-dimensional multivariate linear regression model
    Guo, Wenxing
    Balakrishnan, Narayanaswamy
    Bian, Mengjie
    ELECTRONIC JOURNAL OF STATISTICS, 2021, 15 (02): : 4167 - 4191
  • [45] Bayesian Optimal Two-Sample Tests for High-Dimensional Gaussian Populations
    Lee, Kyoungjae
    You, Kisung
    Lin, Lizhen
    BAYESIAN ANALYSIS, 2024, 19 (03): : 869 - 893
  • [46] EIGENVALUE DISTRIBUTIONS OF VARIANCE COMPONENTS ESTIMATORS IN HIGH-DIMENSIONAL RANDOM EFFECTS MODELS
    Fan, Zhou
    Johnstone, Iain M.
    ANNALS OF STATISTICS, 2019, 47 (05) : 2855 - 2886
  • [47] Limiting Behavior of Largest Entry of Random Tensor Constructed by High-Dimensional Data
    Jiang, Tiefeng
    Xie, Junshan
    JOURNAL OF THEORETICAL PROBABILITY, 2020, 33 (04) : 2380 - 2400
  • [48] Regularized Estimation of Information via Canonical Correlation Analysis on a Finite-Dimensional Feature Space
    De Cabrera, Ferran
    Riba, Jaume
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2023, 69 (08) : 5135 - 5150
  • [49] Largest magnitude for off-diagonal auto-correlation coefficients in high dimensional framework
    Boucher, Maxime
    Chauveau, Didier
    Zani, Marguerite
    STATISTICAL PAPERS, 2025, 66 (04)
  • [50] On high-dimensional tests for mutual independence based on Pearson's correlation coefficient
    Mao, Guangyu
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2020, 49 (14) : 3572 - 3584