LIKELIHOOD RATIO TEST IN MULTIVARIATE LINEAR REGRESSION: FROM LOW TO HIGH DIMENSION

被引:10
作者
He, Yinqiu [1 ]
Jiang, Tiefeng [2 ]
Wen, Jiyang [3 ]
Xu, Gongjun [1 ]
机构
[1] Univ Michigan, Dept Stat, Ann Arbor, MI 48109 USA
[2] Univ Minnesota, Sch Stat, Minneapolis, MN 55455 USA
[3] Johns Hopkins Bloomberg Sch Publ Hlth, Dept Biostat, Baltimore, MD 21205 USA
基金
美国国家科学基金会;
关键词
High dimension; likelihood ratio test; multivariate linear regression; CENTRAL LIMIT-THEOREMS;
D O I
10.5705/ss.202019.0056
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Multivariate linear regressions are widely used to model the associations between multiple related responses and a set of predictors. To infer such associations, researchers often test the structure of the regression coefficients matrix, usually using a likelihood ratio test (LRT). Despite their popularity, classical chi(2) approximations for LRTs are known to fail in high-dimensional settings, where the dimensions of the responses and the predictors (m, p) are allowed to grow with the sample size n. Although various corrected LRTs and other test statistics have been proposed, few studies have examined the important question of when the classic LRT starts to fail. An answer to this would provide insights for practitioners, especially when analyzing data in which m/n and p/n are small, but not negligible. Moreover, the power of the LRT in high-dimensional data analyses remains under-researched. To address these issues, the first part of this work determines the asymptotic boundary at which the classical LRT fails, and develops a corrected limiting distribution for the LRT with a general asymptotic regime. The second part of this work examines the power of the LRT in high-dimensional settings. In addition to advancing the current understanding of the asymptotic behavior of the LRT under an alternative hypothesis, these results motivate the development of a more powerful LRT. The third part of this work considers the setting in which p > n, where the LRT is not well defined. We propose a two-step testing procedure. First, we perform a dimension reduction, and then we apply the proposed LRT. Theoretical properties are developed to ensure the validity of the proposed method, and simulations demonstrate that the method performs well.
引用
收藏
页码:1215 / 1238
页数:24
相关论文
共 39 条
[1]  
Anderson T. W., 1962, An Introduction to Multivariate Statistical Analysis
[2]  
[Anonymous], 2005, Muirhead
[3]  
Bai ZD, 1996, STAT SINICA, V6, P311
[4]   Testing linear hypotheses in high-dimensional regressions [J].
Bai, Zhidong ;
Jiang, Dandan ;
Yao, Jian-feng ;
Zheng, Shurong .
STATISTICS, 2013, 47 (06) :1207-1223
[5]   CORRECTIONS TO LRT ON LARGE-DIMENSIONAL COVARIANCE MATRIX BY RMT [J].
Bai, Zhidong ;
Jiang, Dandan ;
Yao, Jian-Feng ;
Zheng, Shurong .
ANNALS OF STATISTICS, 2009, 37 (6B) :3822-3840
[6]   Conditional Sure Independence Screening [J].
Barut, Emre ;
Fan, Jianqing ;
Verhasselt, Anneleen .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2016, 111 (515) :1266-1277
[7]   VALID POST-SELECTION INFERENCE [J].
Berk, Richard ;
Brown, Lawrence ;
Buja, Andreas ;
Zhang, Kai ;
Zhao, Linda .
ANNALS OF STATISTICS, 2013, 41 (02) :802-837
[8]   REMARKS ON PARALLEL ANALYSIS [J].
BUJA, A ;
EYUBOGLU, N .
MULTIVARIATE BEHAVIORAL RESEARCH, 1992, 27 (04) :509-540
[9]   High-dimensional sparse MANOVA [J].
Cai, T. Tony ;
Xia, Yin .
JOURNAL OF MULTIVARIATE ANALYSIS, 2014, 131 :174-196
[10]   Reduced rank regression via adaptive nuclear norm penalization [J].
Chen, Kun ;
Dong, Hongbo ;
Chan, Kung-Sik .
BIOMETRIKA, 2013, 100 (04) :901-920