Statistical Inference for High-Dimensional Generalized Linear Models With Binary Outcomes

被引:12
|
作者
Cai, T. Tony [1 ]
Guo, Zijian [2 ]
Ma, Rong [3 ]
机构
[1] Univ Penn, Wharton Sch, Dept Stat & Data Sci, Philadelphia, PA 19104 USA
[2] Rutgers State Univ, Dept Stat, Piscataway, NJ 08854 USA
[3] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
关键词
Adaptivity; Confidence interval; Hypothesis testing; Link functions; Optimality; Weighting; CONFIDENCE-INTERVALS; REGRESSION; SELECTION; REGIONS; TESTS;
D O I
10.1080/01621459.2021.1990769
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This article develops a unified statistical inference framework for high-dimensional binary generalized linear models (GLMs) with general link functions. Both unknown and known design distribution settings are considered. A two-step weighted bias-correction method is proposed for constructing confidence intervals (CIs) and simultaneous hypothesis tests for individual components of the regression vector. Minimax lower bound for the expected length is established and the proposed CIs are shown to be rate-optimal up to a logarithmic factor. The numerical performance of the proposed procedure is demonstrated through simulation studies and an analysis of a single cell RNA-seq dataset, which yields interesting biological insights that integrate well into the current literature on the cellular immune response mechanisms as characterized by single-cell transcriptomics. The theoretical analysis provides important insights on the adaptivity of optimal CIs with respect to the sparsity of the regression vector. New lower bound techniques are introduced and they can be of independent interest to solve other inference problems in high-dimensional binary GLMs.
引用
收藏
页码:1319 / 1332
页数:14
相关论文
共 50 条
  • [1] Markov neighborhood regression for statistical inference of high-dimensional generalized linear models
    Sun, Lizhe
    Liang, Faming
    STATISTICS IN MEDICINE, 2022, 41 (20) : 4057 - 4078
  • [2] High-Dimensional Inference for Generalized Linear Models with Hidden Confounding
    Ouyang, Jing
    Tan, Kean Ming
    Xu, Gongjun
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [3] Empirical Bayes inference in sparse high-dimensional generalized linear models
    Tang, Yiqi
    Martin, Ryan
    ELECTRONIC JOURNAL OF STATISTICS, 2024, 18 (02): : 3212 - 3246
  • [4] Online inference in high-dimensional generalized linear models with streaming data
    Luo, Lan
    Han, Ruijian
    Lin, Yuanyuan
    Huang, Jian
    ELECTRONIC JOURNAL OF STATISTICS, 2023, 17 (02): : 3443 - 3471
  • [5] Estimation and Inference for High-Dimensional Generalized Linear Models with Knowledge Transfer
    Li, Sai
    Zhang, Linjun
    Cai, T. Tony
    Li, Hongzhe
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024, 119 (546) : 1274 - 1285
  • [6] Bias-Corrected Inference of High-Dimensional Generalized Linear Models
    Tang, Shengfei
    Shi, Yanmei
    Zhang, Qi
    MATHEMATICS, 2023, 11 (04)
  • [7] Simultaneous Inference for High-Dimensional Linear Models
    Zhang, Xianyang
    Cheng, Guang
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (518) : 757 - 768
  • [8] High-dimensional inference in misspecified linear models
    Buehlmann, Peter
    van de Geer, Sara
    ELECTRONIC JOURNAL OF STATISTICS, 2015, 9 (01): : 1449 - 1473
  • [9] AN ADAPTIVELY RESIZED PARAMETRIC BOOTSTRAP FOR INFERENCE IN HIGH-DIMENSIONAL GENERALIZED LINEAR MODELS
    Zhao, Qian
    Candes, Emmanuel J.
    STATISTICA SINICA, 2025, 35 (01) : 91 - 110
  • [10] STATISTICAL INFERENCE IN SPARSE HIGH-DIMENSIONAL ADDITIVE MODELS
    Gregory, Karl
    Mammen, Enno
    Wahl, Martin
    ANNALS OF STATISTICS, 2021, 49 (03): : 1514 - 1536