SPECTRAL ANALYSIS OF GRAM MATRICES WITH MISSING AT RANDOM OBSERVATIONS: CONVERGENCE, CENTRAL LIMIT THEOREMS, AND APPLICATIONS IN STATISTICAL INFERENCE

被引:0
作者
Li, Huiqin [1 ]
Pan, Guangming [2 ]
Yin, Yanqing [1 ]
Zhou, Wang [3 ]
机构
[1] Chongqing Univ, Sch Math & Stat, Chongqing, Peoples R China
[2] Nanyang Technol Univ, Div Math Sci, Singapore, Singapore
[3] Natl Univ Singapore, Dept Stat & Data Sci, Singapore, Singapore
关键词
Sample covariance matrix; missing observations; high-dimensionality; limiting spec- tral distribution; central limit theorem; random matrix theory; SAMPLE COVARIANCE MATRICES; 2-SAMPLE TEST; EIGENVALUES;
D O I
10.1214/24-AOS2392
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Motivated by the statistical inference using the Gram matrix in the context of missing at random observations, this paper investigates the spectral resents a Hadamard random matrix with entries determined by independent Bernoulli variables D. Operating within the high-dimensional framework, we establish the convergence of the empirical spectral distribution of Sn to a well-defined limiting distribution. In addition, we explore the impact of the missing mechanism on the second-order properties of the spectral distribution of the Gram matrix Sn. We establish the central limit theorem for the linear spectral statistics of Sn, shedding light on their fluctuations. Surprisingly, our analysis reveals that even in the ideal Gaussian distribution scenario, the fluctuations of statistics generated by eigenvalues are influenced by the eigenvectors of the population covariance matrix in the missing-at-random case. This discovery uncovers a remarkable phenomenon that starkly contrasts with the classical case. Subsequently, we demonstrate the practical application of our central limit theorem in hypothesis testing for the population covariance matrix.
引用
收藏
页码:1254 / 1275
页数:22
相关论文
共 19 条
[1]  
Bai ZD, 2004, ANN PROBAB, V32, P553
[2]  
Bai ZD, 1998, ANN PROBAB, V26, P316
[3]  
Bai ZD, 2008, STAT SINICA, V18, P425
[4]   High dimensional linear discriminant analysis: optimality, adaptive algorithm and missing data [J].
Cai, T. Tony ;
Zhang, Linjun .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2019, 81 (04) :675-705
[5]   Two-sample test of high dimensional means under dependence [J].
Cai, T. Tony ;
Liu, Weidong ;
Xia, Yin .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2014, 76 (02) :349-372
[6]   A TWO-SAMPLE TEST FOR HIGH-DIMENSIONAL DATA WITH APPLICATIONS TO GENE-SET TESTING [J].
Chen, Song Xi ;
Qin, Ying-Li .
ANNALS OF STATISTICS, 2010, 38 (02) :808-835
[7]  
Chen YX, 2021, ANN STAT, V49, P2948, DOI [10.1214/21-AOS2066, 10.1214/21-aos2066]
[8]   CONCENTRATION OF MEASURE AND SPECTRA OF RANDOM MATRICES: APPLICATIONS TO CORRELATION MATRICES, ELLIPTICAL DISTRIBUTIONS AND BEYOND [J].
El Karoui, Noureddine .
ANNALS OF APPLIED PROBABILITY, 2009, 19 (06) :2362-2405
[9]   Estimating Number of Factors by Adjusted Eigenvalues Thresholding [J].
Fan, Jianqing ;
Guo, Jianhua ;
Zheng, Shurong .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 117 (538) :852-861
[10]   HIGH-DIMENSIONAL COVARIANCE MATRICES IN ELLIPTICAL DISTRIBUTIONS WITH APPLICATION TO SPHERICAL TEST [J].
Hu, Jiang ;
Li, Weiming ;
Liu, Zhi ;
Zhou, Wang .
ANNALS OF STATISTICS, 2019, 47 (01) :527-555