TWO-SAMPLE TESTING OF HIGH-DIMENSIONAL LINEAR REGRESSION COEFFICIENTS VIA COMPLEMENTARY SKETCHING

被引:0
|
作者
Gao, Fengnan [1 ]
Wang, Tengyao [2 ]
机构
[1] Fudan Univ, Shanghai Ctr Math Sci, Sch Data Sci, Shanghai, Peoples R China
[2] London Sch Econ, Dept Stat, London, England
基金
英国工程与自然科学研究理事会;
关键词
Two-sample hypotheses testing; high-dimensional data; linear model; sparsity; minimax detection; ANOVA;
D O I
10.1214/22-AOS2216
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We introduce a new method for two-sample testing of high-dimensional linear regression coefficients without assuming that those coefficients are individually estimable. The procedure works by first projecting the matrices of covariates and response vectors along directions that are complementary in sign in a subset of the coordinates, a process which we call "complementary sketching." The resulting projected covariates and responses are aggregated to form two test statistics, which are shown to have essentially optimal asymptotic power under a Gaussian design when the difference between the two regression coefficients is sparse and dense respectively. Simulations confirm that our methods perform well in a broad class of settings and an application to a large single-cell RNA sequencing dataset demonstrates its utility in the real world.
引用
收藏
页码:2950 / 2972
页数:23
相关论文
共 50 条
  • [31] Group Transfer Learning for High-Dimensional Linear Regression
    Chen, Chen
    Xu, Dawei
    Ding, Juan
    Zhang, Junjian
    Xiong, Wenjun
    STAT, 2025, 14 (01):
  • [32] Empirical likelihood for high-dimensional linear regression models
    Guo, Hong
    Zou, Changliang
    Wang, Zhaojun
    Chen, Bin
    METRIKA, 2014, 77 (07) : 921 - 945
  • [33] HYPOTHESIS TESTING FOR HIGH-DIMENSIONAL SPARSE BINARY REGRESSION
    Mukherjee, Rajarshi
    Pillai, Natesh S.
    Lin, Xihong
    ANNALS OF STATISTICS, 2015, 43 (01) : 352 - 381
  • [34] Empirical likelihood for high-dimensional linear regression models
    Hong Guo
    Changliang Zou
    Zhaojun Wang
    Bin Chen
    Metrika, 2014, 77 : 921 - 945
  • [35] Testing High-Dimensional Linear Asset Pricing Models
    Lan, Wei
    Feng, Long
    Luo, Ronghua
    JOURNAL OF FINANCIAL ECONOMETRICS, 2018, 16 (02) : 191 - 210
  • [36] An approximate randomization test for the high-dimensional two-sample Behrens-Fisher problem under arbitrary covariances
    Wang, Rui
    Xu, Wangli
    BIOMETRIKA, 2022, 109 (04) : 1117 - 1132
  • [37] Variational Bayes for High-Dimensional Linear Regression With Sparse Priors
    Ray, Kolyan
    Szabo, Botond
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 117 (539) : 1270 - 1281
  • [38] Shrinkage Ridge Regression Estimators in High-Dimensional Linear Models
    Yuzbasi, Bahadir
    Ahmed, S. Ejaz
    PROCEEDINGS OF THE NINTH INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE AND ENGINEERING MANAGEMENT, 2015, 362 : 793 - 807
  • [39] The likelihood ratio test for high-dimensional linear regression model
    Xie, Junshan
    Xiao, Nannan
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2017, 46 (17) : 8479 - 8492
  • [40] A Further Study on Chen–Qin’s Test for Two-Sample Behrens–Fisher Problems for High-Dimensional Data
    Jin-Ting Zhang
    Tianming Zhu
    Journal of Statistical Theory and Practice, 2022, 16