Post-selection inference in regression models for group testing data

被引:0
|
作者
Shen, Qinyan [1 ]
Gregory, Karl [1 ]
Huang, Xianzheng [1 ]
机构
[1] Univ South Carolina, Dept Stat, 219 LeConte,1523 Greene St, Columbia, SC 29208 USA
关键词
confidence intervals; EM algorithm; individual testing; LASSO; variable selection; VALID CONFIDENCE-INTERVALS;
D O I
10.1093/biomtc/ujae101
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We develop a methodology for valid inference after variable selection in logistic regression when the responses are partially observed, that is, when one observes a set of error-prone testing outcomes instead of the true values of the responses. Aiming at selecting important covariates while accounting for missing information in the response data, we apply the expectation-maximization algorithm to compute maximum likelihood estimators subject to LASSO penalization. Subsequent to variable selection, we make inferences on the selected covariate effects by extending post-selection inference methodology based on the polyhedral lemma. Empirical evidence from our extensive simulation study suggests that our post-selection inference results are more reliable than those from naive inference methods that use the same data to perform variable selection and inference without adjusting for variable selection.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Modal regression statistical inference for longitudinal data semivarying coefficient models: Generalized estimating equations, empirical likelihood and variable selection
    Wang, Kangning
    Li, Shaomin
    Sun, Xiaofei
    Lin, Lu
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2019, 133 : 257 - 276
  • [42] Group inference in high dimensions with applications to hierarchical testing
    Guo, Zijian
    Renaux, Claude
    Buehlmann, Peter
    Cai, Tony
    ELECTRONIC JOURNAL OF STATISTICS, 2021, 15 (02): : 6633 - 6676
  • [43] Variable selection in finite mixture of semi-parametric regression models
    Ormoz, Ehsan
    Eskandari, Farzad
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2016, 45 (03) : 695 - 711
  • [44] Variable selection in functional regression models: A review
    Aneiros, German
    Novo, Silvia
    Vieu, Philippe
    JOURNAL OF MULTIVARIATE ANALYSIS, 2022, 188
  • [45] Variable selection in finite mixture of regression models
    Khalili, Abbas
    Chen, Jiahua
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2007, 102 (479) : 1025 - 1038
  • [46] Bayesian group selection in logistic regression with application to MRI data analysis
    Lee, Kyoungjae
    Cao Xuan
    BIOMETRICS, 2021, 77 (02) : 391 - 400
  • [47] Nonparametric Additive Regression for High-Dimensional Group Testing Data
    Zuo, Xinlei
    Ding, Juan
    Zhang, Junjian
    Xiong, Wenjun
    MATHEMATICS, 2024, 12 (05)
  • [48] Targeted Inference Involving High-Dimensional Data Using Nuisance Penalized Regression
    Sun, Qiang
    Zhang, Heping
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2021, 116 (535) : 1472 - 1486
  • [49] A numerical study on group quantile regression models
    Kim, Doyoen
    Jung, Yoonsuh
    COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS, 2019, 26 (04) : 359 - 370
  • [50] Group inference for high-dimensional mediation models
    Yu, Ke
    Guo, Xu
    Luo, Shan
    STATISTICS AND COMPUTING, 2025, 35 (03)