INDEPENDENCE TEST FOR HIGH DIMENSIONAL DATA BASED ON REGULARIZED CANONICAL CORRELATION COEFFICIENTS

被引:38
作者
Yang, Yanrong [1 ]
Pan, Guangming [2 ]
机构
[1] Monash Univ, Dept Econometr & Business Stat, Caulfield, Vic 3145, Australia
[2] Nanyang Technol Univ, Sch Math & Phys Sci, Singapore 639798, Singapore
关键词
Canonical correlation coefficients; central limit theorem; large dimensional random matrix theory; independence test; linear spectral statistics; EMPIRICAL DISTRIBUTION; COVARIANCE MATRICES; EIGENVALUES; STATISTICS;
D O I
10.1214/14-AOS1284
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This paper proposes a new statistic to test independence between two high dimensional random vectors X: p(1) x 1 and Y : p(2) x 1. The proposed statistic is based on the sum of regularized sample canonical correlation coefficients of X and Y. The asymptotic distribution of the statistic under the null hypothesis is established as a corollary of general central limit theorems (CLT) for the linear statistics of classical and regularized sample canonical correlation coefficients when p(1) and p(2) are both comparable to the sample size n. As applications of the developed independence test, various types of dependent structures, such as factor models, ARCH models and a general uncorrelated but dependent case, etc., are investigated by simulations. As an empirical application, cross-sectional dependence of daily stock returns of companies between different sections in the New York Stock Exchange (NYSE) is detected by the proposed test.
引用
收藏
页码:467 / 500
页数:34
相关论文
共 23 条
[1]  
Anderson T. W., 1984, INTRO MULTIVARIATE S
[2]   Determining the number of factors in approximate factor models [J].
Bai, JS ;
Ng, S .
ECONOMETRICA, 2002, 70 (01) :191-221
[3]  
Bai ZD, 1996, STAT SINICA, V6, P311
[4]   ON ESTIMATION OF THE POPULATION SPECTRAL DISTRIBUTION FROM A HIGH-DIMENSIONAL SAMPLE COVARIANCE MATRIX [J].
Bai, Zhidong ;
Chen, Jiaqi ;
Yao, Jianfeng .
AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2010, 52 (04) :423-437
[5]  
Bai ZD, 2010, SPRINGER SER STAT, P433, DOI 10.1007/978-1-4419-0661-8_12
[6]   Regularized estimation of large covariance matrices [J].
Bickel, Peter J. ;
Levina, Elizaveta .
ANNALS OF STATISTICS, 2008, 36 (01) :199-227
[7]   A note on testing the covariance matrix for large dimension [J].
Birke, M ;
Dette, H .
STATISTICS & PROBABILITY LETTERS, 2005, 74 (03) :281-289
[8]   SPECTRUM ESTIMATION FOR LARGE DIMENSIONAL COVARIANCE MATRICES USING RANDOM MATRIX THEORY [J].
El Karoui, Noureddine .
ANNALS OF STATISTICS, 2008, 36 (06) :2757-2790
[9]  
Fan JQ, 2012, J ROY STAT SOC B, V74, P37, DOI 10.1111/j.1467-9868.2011.01005.x
[10]   HIGH-DIMENSIONAL CLASSIFICATION USING FEATURES ANNEALED INDEPENDENCE RULES [J].
Fan, Jianqing ;
Fan, Yingying .
ANNALS OF STATISTICS, 2008, 36 (06) :2605-2637