High-dimensional two-sample mean vectors test and support recovery with factor adjustment

被引:1
作者
He, Yong [1 ]
Zhang, Mingjuan [2 ]
Zhang, Xinsheng [3 ]
Zhou, Wang [4 ]
机构
[1] Shandong Univ, Inst Financial Studies, Jinan 250100, Peoples R China
[2] Shanghai Lixin Univ Accounting & Finance, Sch Stat & Math, Shanghai 201209, Peoples R China
[3] Fudan Univ, Sch Management, Shanghai 200433, Peoples R China
[4] Natl Univ Singapore, Dept Stat & Appl Probabil, Singapore 117546, Singapore
基金
美国国家科学基金会;
关键词
False discovery rate; High-dimensional; Multiple testing; Multiplier bootstrap; Two-sample test; FALSE DISCOVERY RATE; LARGE COVARIANCE ESTIMATION; PRINCIPAL-COMPONENTS; FEWER OBSERVATIONS; FACTOR MODELS; NUMBER; MATRIX; PROPORTION; BOOTSTRAP; RETURNS;
D O I
10.1016/j.csda.2020.107004
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Testing the equality of two mean vectors is a classical problem in multivariate analysis. In this article, we consider the test in the high-dimensional setting. Existing tests often assume that the covariance matrix (or its inverse) of the underlying variables is sparse, which is rarely true in social science due to the existence of latent common factors. In the article, we introduce a maximum-type test statistic based on the factor-adjusted data. The factor-adjustment step increases the signal-to-noise ratio and thus results in more powerful test. We obtain the limiting null distribution of the maximum-type test statistic, which is the extreme value distribution of type I. To overcome the well-known slow convergence rate of the test statistic's distribution to the limiting extreme value distribution, we also propose a multiplier bootstrap method to improve the finite-sample performance. In addition, a multiple testing procedure with false discovery rate (FDR) control is proposed for identifying specific locations that differ significantly between the two groups. Thorough numerical studies are conducted to show the superiority of the test over other state-of-the-art tests. The performance of the test is also assessed through a real stock market dataset. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:19
相关论文
共 56 条
[1]   Eigenvalue Ratio Test for the Number of Factors [J].
Ahn, Seung C. ;
Horenstein, Alex R. .
ECONOMETRICA, 2013, 81 (03) :1203-1227
[2]  
Anderson TW., 2003, An introduction to multivariate statistical analysis, V3
[3]  
[Anonymous], ARXIV14102848
[4]  
[Anonymous], NBER WORKING PAPER
[5]  
[Anonymous], 2011, J AM STAT ASS
[6]   Inferential theory for factor models of large dimensions. [J].
Bai, J .
ECONOMETRICA, 2003, 71 (01) :135-171
[7]   Determining the number of factors in approximate factor models [J].
Bai, JS ;
Ng, S .
ECONOMETRICA, 2002, 70 (01) :191-221
[8]   STATISTICAL ANALYSIS OF FACTOR MODELS OF HIGH DIMENSION [J].
Bai, Jushan ;
Li, Kunpeng .
ANNALS OF STATISTICS, 2012, 40 (01) :436-465
[9]  
Bai ZD, 1996, STAT SINICA, V6, P311
[10]  
Benjamini Y, 2001, ANN STAT, V29, P1165