TWO-SAMPLE TESTING OF HIGH-DIMENSIONAL LINEAR REGRESSION COEFFICIENTS VIA COMPLEMENTARY SKETCHING

被引:0
|
作者
Gao, Fengnan [1 ]
Wang, Tengyao [2 ]
机构
[1] Fudan Univ, Shanghai Ctr Math Sci, Sch Data Sci, Shanghai, Peoples R China
[2] London Sch Econ, Dept Stat, London, England
基金
英国工程与自然科学研究理事会;
关键词
Two-sample hypotheses testing; high-dimensional data; linear model; sparsity; minimax detection; ANOVA;
D O I
10.1214/22-AOS2216
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We introduce a new method for two-sample testing of high-dimensional linear regression coefficients without assuming that those coefficients are individually estimable. The procedure works by first projecting the matrices of covariates and response vectors along directions that are complementary in sign in a subset of the coordinates, a process which we call "complementary sketching." The resulting projected covariates and responses are aggregated to form two test statistics, which are shown to have essentially optimal asymptotic power under a Gaussian design when the difference between the two regression coefficients is sparse and dense respectively. Simulations confirm that our methods perform well in a broad class of settings and an application to a large single-cell RNA sequencing dataset demonstrates its utility in the real world.
引用
收藏
页码:2950 / 2972
页数:23
相关论文
共 50 条
  • [21] ACCURACY ASSESSMENT FOR HIGH-DIMENSIONAL LINEAR REGRESSION
    Cai, T. Tony
    Guo, Zijian
    ANNALS OF STATISTICS, 2018, 46 (04) : 1807 - 1836
  • [22] Elementary Estimators for High-Dimensional Linear Regression
    Yang, Eunho
    Lozano, Aurelie C.
    Ravikumar, Pradeep
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 2), 2014, 32 : 388 - 396
  • [23] A Note on High-Dimensional Linear Regression With Interactions
    Hao, Ning
    Zhang, Hao Helen
    AMERICAN STATISTICIAN, 2017, 71 (04) : 291 - 297
  • [24] Tests for High-Dimensional Regression Coefficients With Factorial Designs
    Zhong, Ping-Shou
    Chen, Song Xi
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (493) : 260 - 274
  • [25] Statistical inference via conditional Bayesian posteriors in high-dimensional linear regression
    Wu, Teng
    Narisetty, Naveen N.
    Yang, Yun
    ELECTRONIC JOURNAL OF STATISTICS, 2023, 17 (01): : 769 - 797
  • [26] Two-sample multivariate tests for high-dimensional data when one covariance matrix is unknown
    Thonghnunui, Nittaya
    Chongcharoen, Samruam
    Jiamwattanapong, Knavoot
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2023, 52 (03) : 669 - 684
  • [27] Penalized least-squares estimation for regression coefficients in high-dimensional partially linear models
    Ni, Huey-Fan
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2012, 142 (02) : 379 - 389
  • [28] Testing linear hypotheses in high-dimensional regressions
    Bai, Zhidong
    Jiang, Dandan
    Yao, Jian-feng
    Zheng, Shurong
    STATISTICS, 2013, 47 (06) : 1207 - 1223
  • [29] TWO-SAMPLE AND ANOVA TESTS FOR HIGH DIMENSIONAL MEANS
    Chen, Song Xi
    Li, Jun
    Zhong, Ping-Shou
    ANNALS OF STATISTICS, 2019, 47 (03) : 1443 - 1474
  • [30] Consistent group selection in high-dimensional linear regression
    Wei, Fengrong
    Huang, Jian
    BERNOULLI, 2010, 16 (04) : 1369 - 1384