Sampling without replacement from a high-dimensional finite population

被引:1
作者
Hu, Jiang [1 ,2 ]
Wang, Shaochen [3 ]
Zhang, Yangchun [4 ]
Zhou, Wang [5 ]
机构
[1] Northeast Normal Univ, KLASMOE, Changchun 130024, Peoples R China
[2] Northeast Normal Univ, Sch Math & Stat, Changchun 130024, Peoples R China
[3] South China Univ Technol, Sch Math, Guangzhou 510640, Peoples R China
[4] Shanghai Univ, Dept Met, Shanghai 200444, Peoples R China
[5] Natl Univ Singapore, Dept Stat & Data Sci, Singapore 117546, Singapore
关键词
Largest eigenvalue; Tracy-Widom law; sample covariance matrix; finite population model; parallel analysis; TRACY-WIDOM LIMIT; COVARIANCE MATRICES; LARGEST EIGENVALUE; PRINCIPAL COMPONENTS; UNIVERSALITY; FLUCTUATIONS;
D O I
10.3150/22-BEJ1580
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
It is well known that most of the existing theoretical results in statistics are based on the assumption that the sample is generated with replacement from an infinite population. However, in practice, available samples are almost always collected without replacement. If the population is a finite set of real numbers, whether we can still safely use the results from samples drawn without replacement becomes an important problem. In this paper, we focus on the eigenvalues of high-dimensional sample covariance matrices generated without replacement from finite populations. Specifically, we derive the Tracy-Widom laws for their largest eigenvalues and apply these results to parallel analysis. We provide new insight into the permutation methods proposed by Buja and Eyuboglu in [Multivar Behav Res. 27(4) (1992) 509-540]. Simulation and real data studies are conducted to demonstrate our results.
引用
收藏
页码:3198 / 3220
页数:23
相关论文
共 50 条
  • [21] High-Dimensional Linear Models: A Random Matrix Perspective
    Namdari, Jamshid
    Paul, Debashis
    Wang, Lili
    SANKHYA-SERIES A-MATHEMATICAL STATISTICS AND PROBABILITY, 2021, 83 (02): : 645 - 695
  • [22] Global Testing and Large-Scale Multiple Testing for High-Dimensional Covariance Structures
    Cai, T. Tony
    ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 4, 2017, 4 : 423 - 446
  • [23] CLT FOR LARGEST EIGENVALUES AND UNIT ROOT TESTING FOR HIGH-DIMENSIONAL NONSTATIONARY TIME SERIES
    Zhang, Bo
    Pan, Guangming
    Gao, Jiti
    ANNALS OF STATISTICS, 2018, 46 (05) : 2186 - 2215
  • [24] TWO SAMPLE TESTS FOR HIGH-DIMENSIONAL COVARIANCE MATRICES
    Li, Jun
    Chen, Song Xi
    ANNALS OF STATISTICS, 2012, 40 (02) : 908 - 940
  • [25] A New Statistic for Testing Covariance Equality in High-Dimensional Gaussian Low-Rank Models
    Beisson, Remi
    Vallet, Pascal
    Giremus, Audrey
    Ginolhac, Guillaume
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2024, 72 : 1797 - 1812
  • [26] Estimation of the number of spikes, possibly equal, in the high-dimensional case
    Passemier, Damien
    Yao, Jianfeng
    JOURNAL OF MULTIVARIATE ANALYSIS, 2014, 127 : 173 - 183
  • [27] Strong Replica Symmetry in High-Dimensional Optimal Bayesian Inference
    Barbier, Jean
    Panchenko, Dmitry
    COMMUNICATIONS IN MATHEMATICAL PHYSICS, 2022, 393 (3) : 1199 - 1239
  • [28] CONTIGUITY UNDER HIGH-DIMENSIONAL GAUSSIANITY WITH APPLICATIONS TO COVARIANCE TESTING
    Han, Qiyang
    Jiang, Tiefeng
    Shen, Yandi
    ANNALS OF APPLIED PROBABILITY, 2023, 33 (6A) : 4272 - 4321
  • [29] TEST FOR BANDEDNESS OF HIGH-DIMENSIONAL COVARIANCE MATRICES AND BANDWIDTH ESTIMATION
    Qiu, Yumou
    Chen, Song Xi
    ANNALS OF STATISTICS, 2012, 40 (03) : 1285 - 1314
  • [30] Survival analysis with high-dimensional covariates
    Witten, Daniela M.
    Tibshirani, Robert
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2010, 19 (01) : 29 - 51