Analysis of Testing-Based Forward Model Selection

被引:7
|
作者
Kozbur, Damian [1 ]
机构
[1] Univ Zurich, Dept Econ, Zurich, Switzerland
关键词
Model selection; forward regression; sparsity; hypothesis testing; VARIABLE SELECTION; CONFIDENCE-INTERVALS; LEAST-SQUARES; REGRESSION; HETEROSKEDASTICITY; INFERENCE; LASSO; TIME;
D O I
10.3982/ECTA16273
中图分类号
F [经济];
学科分类号
02 ;
摘要
This paper analyzes a procedure called Testing-Based Forward Model Selection (TBFMS) in linear regression problems. This procedure inductively selects covariates that add predictive power into a working statistical model before estimating a final regression. The criterion for deciding which covariate to include next and when to stop including covariates is derived from a profile of traditional statistical hypothesis tests. This paper proves probabilistic bounds, which depend on the quality of the tests, for prediction error and the number of selected covariates. As an example, the bounds are then specialized to a case with heteroscedastic data, with tests constructed with the help of Huber-Eicker-White standard errors. Under the assumed regularity conditions, these tests lead to estimation convergence rates matching other common high-dimensional estimators including Lasso.
引用
收藏
页码:2147 / 2173
页数:27
相关论文
共 50 条
  • [31] Sparse support vector regression based on orthogonal forward selection for the generalised kernel model
    Wang, X. X.
    Chen, S.
    Lowe, D.
    Harris, C. J.
    NEUROCOMPUTING, 2006, 70 (1-3) : 462 - 474
  • [32] Model selection, hypothesis testing, and risks of condemning analytical tools
    Steidl, Robert J.
    JOURNAL OF WILDLIFE MANAGEMENT, 2006, 70 (06) : 1497 - 1498
  • [33] Model selection techniques for sparse weight-based principal component analysis
    de Schipper, Niek C.
    Van Deun, Katrijn
    JOURNAL OF CHEMOMETRICS, 2021, 35 (02)
  • [34] Asymptotically Uniform Tests After Consistent Model Selection in the Linear Regression Model
    McCloskey, Adam
    JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2020, 38 (04) : 810 - 825
  • [35] HDSI: High dimensional selection with interactions algorithm on feature selection and testing
    Jain, Rahi
    Xu, Wei
    PLOS ONE, 2021, 16 (02):
  • [36] Variable selection in model-based clustering and discriminant analysis with a regularization approach
    Celeux, Gilles
    Maugis-Rabusseau, Cathy
    Sedki, Mohammed
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2019, 13 (01) : 259 - 278
  • [37] Variable selection in model-based clustering and discriminant analysis with a regularization approach
    Gilles Celeux
    Cathy Maugis-Rabusseau
    Mohammed Sedki
    Advances in Data Analysis and Classification, 2019, 13 : 259 - 278
  • [38] Compact Face Representation via Forward Model Selection
    Shao, Weiyuan
    Wang, Hong
    Zheng, Yingbin
    Ye, Hao
    BIOMETRIC RECOGNITION, 2016, 9967 : 112 - 120
  • [39] Model population analysis for variable selection
    Li, Hong-Dong
    Liang, Yi-Zeng
    Xu, Qing-Song
    Cao, Dong-Sheng
    JOURNAL OF CHEMOMETRICS, 2010, 24 (7-8) : 418 - 423
  • [40] ADAPTIVE SEMI-VARYING COEFFICIENT MODEL SELECTION
    Hu, Tao
    Xia, Yingcun
    STATISTICA SINICA, 2012, 22 (02) : 575 - 599