Inference in High-Dimensional Multivariate Response Regression with Hidden Variables

被引:2
|
作者
Bing, Xin [1 ]
Cheng, Wei [2 ]
Feng, Huijie [3 ]
Ning, Yang [4 ]
机构
[1] Univ Toronto, Dept Stat Sci, Toronto, ON, Canada
[2] Brown Univ, Ctr Computat Mol Biol, Providence, RI 02912 USA
[3] Microsoft, Bellevue, WA USA
[4] Cornell Univ, Dept Stat & Data Sci, Ithaca, NY 14850 USA
关键词
Confidence intervals; Confounding; Hidden variables; High-dimensional regression; Hypothesis testing; Multivariate response regression; Surrogate variable analysis; CONFIDENCE-REGIONS; NUMBER; INTERVALS; SELECTION; MODELS; TESTS;
D O I
10.1080/01621459.2023.2241701
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This article studies the inference of the regression coefficient matrix under multivariate response linear regressions in the presence of hidden variables. A novel procedure for constructing confidence intervals of entries of the coefficient matrix is proposed. Our method first uses the multivariate nature of the responses by estimating and adjusting the hidden effect to construct an initial estimator of the coefficient matrix. By further deploying a low-dimensional projection procedure to reduce the bias introduced by the regularization in the previous step, a refined estimator is proposed and shown to be asymptotically normal. The asymptotic variance of the resulting estimator is derived with closed-form expression and can be consistently estimated. In addition, we propose a testing procedure for the existence of hidden effects and provide its theoretical justification. Both our procedures and their analyses are valid even when the feature dimension and the number of responses exceed the sample size. Our results are further backed up via extensive simulations and a real data analysis. Supplementary materials for this article are available online.
引用
收藏
页码:2066 / 2077
页数:12
相关论文
共 50 条
  • [1] Inference for high-dimensional instrumental variables regression
    Gold, David
    Lederer, Johannes
    Tao, Jing
    JOURNAL OF ECONOMETRICS, 2020, 217 (01) : 79 - 111
  • [2] Response best-subset selector for multivariate regression with high-dimensional response variables
    Hu, Jianhua
    Huang, Jian
    Liu, Xiaoqian
    Liu, Xu
    BIOMETRIKA, 2023, 110 (01) : 205 - 223
  • [3] On inference in high-dimensional regression
    Battey, Heather S.
    Reid, Nancy
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2023, 85 (01) : 149 - 175
  • [4] Multivariate linear regression of high-dimensional fMRI data with multiple target variables
    Valente, Giancarlo
    Castellanos, Agustin Lage
    Vanacore, Gianluca
    Formisano, Elia
    HUMAN BRAIN MAPPING, 2014, 35 (05) : 2163 - 2177
  • [5] ADAPTIVE ESTIMATION IN MULTIVARIATE RESPONSE REGRESSION WITH HIDDEN VARIABLES
    Bing, Xin
    Ning, Yang
    Xu, Yaosheng
    ANNALS OF STATISTICS, 2022, 50 (02): : 640 - 672
  • [6] Inference for High-Dimensional Censored Quantile Regression
    Fei, Zhe
    Zheng, Qi
    Hong, Hyokyoung G.
    Li, Yi
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (542) : 898 - 912
  • [7] Variational Inference in high-dimensional linear regression
    Mukherjee, Sumit
    Sen, Subhabrata
    JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23
  • [8] Variational Inference in high-dimensional linear regression
    Mukherjee, Sumit
    Sen, Subhabrata
    Journal of Machine Learning Research, 2022, 23
  • [9] Markov Neighborhood Regression for High-Dimensional Inference
    Liang, Faming
    Xue, Jingnan
    Jia, Bochao
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 117 (539) : 1200 - 1214
  • [10] Bayesian Inference on High-Dimensional Multivariate Binary Responses
    Chakraborty, Antik
    Ou, Rihui
    Dunson, David B.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023,