Group inference of high-dimensional single-index models

被引:0
作者
Han, Dongxiao [1 ,2 ]
Han, Miao [3 ]
Hao, Meiling [4 ]
Sun, Liuquan [5 ,6 ]
Wang, Siyang [7 ]
机构
[1] Nankai Univ, Sch Stat & Data Sci, KLMDASR, LEBPS, Tianjin, Peoples R China
[2] Nankai Univ, LPMC, Tianjin, Peoples R China
[3] Shanghai Univ Finance & Econ, Sch Stat & Management, Shanghai, Peoples R China
[4] Univ Int Business & Econ, Sch Stat, Beijing, Peoples R China
[5] Chinese Acad Sci, Acad Math & Syst Sci, Beijing, Peoples R China
[6] Univ Chinese Acad Sci, Sch Math Sci, Beijing, Peoples R China
[7] China Minsheng Bank Corp Ltd, Dept Asset Management, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Group inference; high dimension; hypothesis testing; single-index models; semi-supervised learning; CONFIDENCE-INTERVALS; REGRESSION; EFFICIENT;
D O I
10.1080/10485252.2024.2371524
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
For the supervised and semi-supervised settings, a group inference method is proposed for regression parameters in high-dimensional semi-parametric single-index models with an unknown random link function. The inference procedure is based on least squares, which can be extended to other general convex loss functions. The proposed test statistics are weighted quadratic forms of the regression parameter estimates, in which the weight could be a non-random matrix or the sample covariance matrix of the covariates. The proposed method could detect dense but weak signals and deal with high correlation of covariates inside the group. A 'contaminated test statistic' is established in the semi-supervised regime to decrease the variance. The asymptotic properties of the resulting estimators are established. The finite-sample behaviour of the proposed method is evaluated through extensive simulation studies. Applications to two genomic datasets are provided.
引用
收藏
页数:20
相关论文
共 40 条
  • [1] Semisupervised inference for explained variance in high dimensional linear regression and its applications
    Cai, T. Tony
    Guo, Zijian
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2020, 82 (02) : 391 - 419
  • [2] A Constrained l1 Minimization Approach to Sparse Precision Matrix Estimation
    Cai, Tony
    Liu, Weidong
    Luo, Xi
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (494) : 594 - 607
  • [3] ON THE THEORY OF ELLIPTICALLY CONTOURED DISTRIBUTIONS
    CAMBANIS, S
    HUANG, S
    SIMONS, G
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 1981, 11 (03) : 368 - 385
  • [4] Generalized partially linear single-index models
    Carroll, RJ
    Fan, JQ
    Gijbels, I
    Wand, MP
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1997, 92 (438) : 477 - 489
  • [5] EFFICIENT AND ADAPTIVE LINEAR REGRESSION IN SEMI-SUPERVISED SETTINGS
    Chakrabortty, Abhishek
    Cai, Tianxi
    [J]. ANNALS OF STATISTICS, 2018, 46 (04) : 1541 - 1572
  • [6] Chen S, 2017, PR MACH LEARN RES, V70
  • [7] CENTRAL LIMIT THEOREMS AND BOOTSTRAP IN HIGH DIMENSIONS
    Chernozhukov, Victor
    Chetverikov, Denis
    Kato, Kengo
    [J]. ANNALS OF PROBABILITY, 2017, 45 (04) : 2309 - 2352
  • [8] High-dimensional simultaneous inference with the bootstrap
    Dezeure, Ruben
    Buhlmann, Peter
    Zhang, Cun-Hui
    [J]. TEST, 2017, 26 (04) : 685 - 719
  • [9] Eftekhari H, 2021, J MACH LEARN RES, V22
  • [10] FarmTest: Factor-Adjusted Robust Multiple Testing With Approximate False Discovery Control
    Fan, Jianqing
    Ke, Yuan
    Sun, Qiang
    Zhou, Wen-Xin
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2019, 114 (528) : 1880 - 1893