TWO-SAMPLE TESTING OF HIGH-DIMENSIONAL LINEAR REGRESSION COEFFICIENTS VIA COMPLEMENTARY SKETCHING

被引:0
|
作者
Gao, Fengnan [1 ]
Wang, Tengyao [2 ]
机构
[1] Fudan Univ, Shanghai Ctr Math Sci, Sch Data Sci, Shanghai, Peoples R China
[2] London Sch Econ, Dept Stat, London, England
基金
英国工程与自然科学研究理事会;
关键词
Two-sample hypotheses testing; high-dimensional data; linear model; sparsity; minimax detection; ANOVA;
D O I
10.1214/22-AOS2216
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We introduce a new method for two-sample testing of high-dimensional linear regression coefficients without assuming that those coefficients are individually estimable. The procedure works by first projecting the matrices of covariates and response vectors along directions that are complementary in sign in a subset of the coordinates, a process which we call "complementary sketching." The resulting projected covariates and responses are aggregated to form two test statistics, which are shown to have essentially optimal asymptotic power under a Gaussian design when the difference between the two regression coefficients is sparse and dense respectively. Simulations confirm that our methods perform well in a broad class of settings and an application to a large single-cell RNA sequencing dataset demonstrates its utility in the real world.
引用
收藏
页码:2950 / 2972
页数:23
相关论文
共 50 条
  • [1] TWO-SAMPLE TESTS FOR HIGH-DIMENSIONAL LINEAR REGRESSION WITH AN APPLICATION TO DETECTING INTERACTIONS
    Xia, Yin
    Cai, Tianxi
    Cai, T. Tony
    STATISTICA SINICA, 2018, 28 (01) : 63 - 92
  • [2] Two-sample high-dimensional empirical likelihood
    Fang, Jianglin
    Liu, Wanrong
    Lu, Xuewen
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2017, 46 (13) : 6323 - 6335
  • [3] Order test for high-dimensional two-sample means
    Lee, Sang H.
    Lim, Johan
    Li, Erning
    Vannucci, Marina
    Petkova, Eva
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2012, 142 (09) : 2719 - 2725
  • [4] A Robust High-Dimensional Test for Two-Sample Comparisons
    Bulut, Hasan
    Iftikhar, Soofia
    Faiz, Nosheen
    Albalawi, Olayan
    AXIOMS, 2024, 13 (09)
  • [5] Two-Sample Covariance Matrix Testing and Support Recovery in High-Dimensional and Sparse Settings
    Cai, Tony
    Liu, Weidong
    Xia, Yin
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2013, 108 (501) : 265 - 277
  • [6] Empirical likelihood test for high-dimensional two-sample model
    Ciuperca, Gabriela
    Salloum, Zahraa
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2016, 178 : 37 - 60
  • [7] TWO-SAMPLE BEHRENS-FISHER PROBLEM FOR HIGH-DIMENSIONAL DATA
    Feng, Long
    Zou, Changliang
    Wang, Zhaojun
    Zhu, Lixing
    STATISTICA SINICA, 2015, 25 (04) : 1297 - 1312
  • [8] A high-dimensional two-sample test for the mean using random subspaces
    Thulin, Mans
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 74 : 26 - 38
  • [9] Two-sample mean vector projection test in high-dimensional data
    Huang, Caizhu
    Cui, Xia
    Pagui, Euloge Clovis Kenne
    COMPUTATIONAL STATISTICS, 2024, 39 (03) : 1061 - 1091
  • [10] Two-sample mean vector projection test in high-dimensional data
    Caizhu Huang
    Xia Cui
    Euloge Clovis Kenne Pagui
    Computational Statistics, 2024, 39 : 1061 - 1091