Sample canonical correlation coefficients of high-dimensional random vectors with finite rank correlations

被引:2
作者
Ma, Zongming [1 ]
Yang, Fan [1 ]
机构
[1] Univ Penn, Dept Stat & Data Sci, Philadelphia, PA 19104 USA
关键词
Canonical correlation analysis; BBP transition; Tracy-Widom law; edge eigenvalues; CENTRAL LIMIT-THEOREMS; LARGEST EIGENVALUE; MULTIVARIATE-ANALYSIS; PRINCIPAL COMPONENTS; COVARIANCE MATRICES; DISTRIBUTIONS; DEFORMATION; SPECTRUM; OUTLIERS; CCA;
D O I
10.3150/22-BEJ1525
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Consider two random vectors (similar to)x = Az + C-1/2 (1) x is an element of R-p and (similar to)y = Bz + C-1/2 (2) y is an element of R-q, where x is an element of R-p, y is an element of R-q and z is an element of R-r are independent random vectors with i.i.d. entries of zero mean and unit variance, C-1 and C-2 are p x p and q x q deterministic population covariance matrices, and A and B are p x r and q x r deterministic factor loading matrices. With n independent observations of (similar to)x and (similar to)y, we study the sample canonical correlations between them. Under the sharp fourth moment condition on the entries of x, y and z, we prove the BBP transition for the sample canonical correlation coefficients (CCCs). More precisely, if a population CCC is below a threshold, then the corresponding sample CCC converges to the right edge of the bulk eigenvalue spectrum of the sample canonical correlation matrix and satisfies the famous Tracy-Widom law; if a population CCC is above the threshold, then the corresponding sample CCC converges to an outlier that is detached from the bulk eigenvalue spectrum. We prove our results in full generality, in the sense that they also hold for near-degenerate population CCCs and population CCCs that are close to the threshold.
引用
收藏
页码:1905 / 1932
页数:28
相关论文
共 50 条
  • [21] Two sample tests for high-dimensional autocovariances
    Baek, Changryong
    Gates, Katheleen M.
    Leinwand, Benjamin
    Pipiras, Vladas
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2021, 153
  • [22] Strong Consistency of Log-Likelihood-Based Information Criterion in High-Dimensional Canonical Correlation Analysis
    Oda, Ryoya
    Yanagihara, Hirokazu
    Fujikoshi, Yasunori
    SANKHYA-SERIES A-MATHEMATICAL STATISTICS AND PROBABILITY, 2021, 83 (01): : 109 - 127
  • [23] High-Dimensional Clustering via Random Projections
    Anderlucci, Laura
    Fortunato, Francesca
    Montanari, Angela
    JOURNAL OF CLASSIFICATION, 2022, 39 (01) : 191 - 216
  • [24] Testing for High-Dimensional Geometry in Random Graphs
    Bubeck, Sebastien
    Ding, Jian
    Eldan, Ronen
    Racz, Miklos Z.
    RANDOM STRUCTURES & ALGORITHMS, 2016, 49 (03) : 503 - 532
  • [25] COMPUTABLE ERROR BOUNDS FOR HIGH-DIMENSIONAL APPROXIMATIONS OF AN LR STATISTIC FOR ADDITIONAL INFORMATION IN CANONICAL CORRELATION ANALYSIS
    Wakaki, H.
    Fujikoshi, Ya.
    THEORY OF PROBABILITY AND ITS APPLICATIONS, 2018, 62 (01) : 157 - 172
  • [26] Strong Consistency of Log-Likelihood-Based Information Criterion in High-Dimensional Canonical Correlation Analysis
    Ryoya Oda
    Hirokazu Yanagihara
    Yasunori Fujikoshi
    Sankhya A, 2021, 83 : 109 - 127
  • [27] Random or Nonrandom Signal in High-Dimensional Regimes
    Liu, Yiming
    Liang, Ying-Chang
    Pan, Guangming
    Zhang, Zhixiang
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2023, 69 (01) : 298 - 315
  • [28] DISTRIBUTION OF LEVELS IN HIGH-DIMENSIONAL RANDOM LANDSCAPES
    Kabluchko, Zakhar
    ANNALS OF APPLIED PROBABILITY, 2012, 22 (01) : 337 - 362
  • [29] Inference for high-dimensional differential correlation matrices
    Cai, T. Tony
    Zhang, Anru
    JOURNAL OF MULTIVARIATE ANALYSIS, 2016, 143 : 107 - 126
  • [30] Test on the linear combinations of mean vectors in high-dimensional data
    Li, Huiqin
    Hu, Jiang
    Bai, Zhidong
    Yin, Yanqing
    Zou, Kexin
    TEST, 2017, 26 (01) : 188 - 208