Analysis of Testing-Based Forward Model Selection

被引:7
|
作者
Kozbur, Damian [1 ]
机构
[1] Univ Zurich, Dept Econ, Zurich, Switzerland
关键词
Model selection; forward regression; sparsity; hypothesis testing; VARIABLE SELECTION; CONFIDENCE-INTERVALS; LEAST-SQUARES; REGRESSION; HETEROSKEDASTICITY; INFERENCE; LASSO; TIME;
D O I
10.3982/ECTA16273
中图分类号
F [经济];
学科分类号
02 ;
摘要
This paper analyzes a procedure called Testing-Based Forward Model Selection (TBFMS) in linear regression problems. This procedure inductively selects covariates that add predictive power into a working statistical model before estimating a final regression. The criterion for deciding which covariate to include next and when to stop including covariates is derived from a profile of traditional statistical hypothesis tests. This paper proves probabilistic bounds, which depend on the quality of the tests, for prediction error and the number of selected covariates. As an example, the bounds are then specialized to a case with heteroscedastic data, with tests constructed with the help of Huber-Eicker-White standard errors. Under the assumed regularity conditions, these tests lead to estimation convergence rates matching other common high-dimensional estimators including Lasso.
引用
收藏
页码:2147 / 2173
页数:27
相关论文
共 50 条
  • [1] Model selection based on combined penalties for biomarker identification
    Vradi, Eleni
    Brannath, Werner
    Jaki, Thomas
    Vonk, Richardus
    JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2018, 28 (04) : 735 - 749
  • [2] High-dimensional linear model selection motivated by multiple testing
    Furmanczyk, Konrad
    Rejchel, Wojciech
    STATISTICS, 2020, 54 (01) : 152 - 166
  • [3] Thresholding-based iterative selection procedures for model selection and shrinkage
    She, Yiyuan
    ELECTRONIC JOURNAL OF STATISTICS, 2009, 3 : 384 - 415
  • [4] Forward stability and model path selection
    Kissel, Nicholas
    Mentch, Lucas
    STATISTICS AND COMPUTING, 2024, 34 (02)
  • [5] CONFIDENCE SETS FOR MODEL SELECTION BY F-TESTING
    Ferrari, Davide
    Yang, Yuhong
    STATISTICA SINICA, 2015, 25 (04) : 1637 - 1658
  • [6] Pitfalls of post-model-selection testing: experimental quantification
    Demetrescu, Matei
    Hassler, Uwe
    Kuzin, Vladimir
    EMPIRICAL ECONOMICS, 2011, 40 (02) : 359 - 372
  • [7] Hypothesis testing: a model selection approach
    Cubedo, M
    Oller, JM
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2002, 108 (1-2) : 3 - 21
  • [8] A Model Selection Method for Nonlinear System Identification Based fMRI Effective Connectivity Analysis
    Li, Xingfeng
    Coyle, Damien
    Maguire, Liam
    McGinnity, Thomas M.
    Benali, Habib
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2011, 30 (07) : 1365 - 1380
  • [9] Marginal Likelihood Computation for Model Selection and Hypothesis Testing: An Extensive Review
    Llorente, F.
    Martino, L.
    Delgado, D.
    Lopez-Santiago, J.
    SIAM REVIEW, 2023, 65 (01) : 3 - 58
  • [10] On Tuning Parameter Selection in Model Selection and Model Averaging: A Monte Carlo Study
    Xiao, Hui
    Sun, Yiguo
    JOURNAL OF RISK AND FINANCIAL MANAGEMENT, 2019, 12 (03)