Post-Selection Inference for Generalized Linear Models With Many Controls

被引:92
作者
Belloni, Alexandre [1 ]
Chernozhukov, Victor [2 ]
Wei, Ying [3 ]
机构
[1] Duke Univ, Fuqua Sch Business, Durham, NC 27708 USA
[2] MIT, Dept Econ, Cambridge, MA 02139 USA
[3] Columbia Univ, Dept Biostat, New York, NY 10032 USA
关键词
Double selection; Instruments; Model selection; Neymanization; Optimality; Sparsity; Uniformly valid inference; LOGISTIC-REGRESSION; LASSO; ESTIMATORS;
D O I
10.1080/07350015.2016.1166116
中图分类号
F [经济];
学科分类号
02 ;
摘要
This article considers generalized linear models in the presence of many controls. We lay out a general methodology to estimate an effect of interest based on the construction of an instrument that immunizes against model selection mistakes and apply it to the case of logistic binary choice model. More specifically we propose new methods for estimating and constructing confidence regions for a regression parameter of primary interest alpha(0), a parameter in front of the regressor of interest, such as the treatment variable or a policy variable. These methods allow to estimate alpha(0) at the root-n rate when the total number p of other regressors, called controls, potentially exceeds the sample size n using sparsity assumptions. The sparsity assumption means that there is a subset of son controls, which suffices to accurately approximate the nuisance part of the regression function. Importantly, the estimators and these resulting confidence regions are valid uniformly over s-sparse models satisfying s(2) log(2) p = o(n) and other technical conditions. These procedures do not rely on traditional consistent model selection arguments for their validity. In fact, they are robust with respect to moderate model selection mistakes in variable selection. Under suitable conditions, the estimators are semi-parametrically efficient in the sense of attaining the semi-parametric efficiency bounds for the class of models in this article.
引用
收藏
页码:606 / 619
页数:14
相关论文
共 31 条
[1]  
[Anonymous], 2010, LASSO METHODS GAUSSI
[2]  
[Anonymous], 2014, REV ECON STUD, V81, P608
[3]  
[Anonymous], 2015, BIOMETRIKA, V102, P77
[4]  
[Anonymous], 2006, ANN STAT, V34, P2554
[5]   Self-concordant analysis for logistic regression [J].
Bach, Francis .
ELECTRONIC JOURNAL OF STATISTICS, 2010, 4 :384-414
[6]   Sparse Models and Methods for Optimal Instruments With an Application to Eminent Domain [J].
Belloni, A. ;
Chen, D. ;
Chernozhukov, V. ;
Hansen, C. .
ECONOMETRICA, 2012, 80 (06) :2369-2429
[7]  
Belloni A., 2013, ECONOMETRIC IN PRESS
[8]  
Belloni A., 2013, ADV EC ECONOMETRICS, V3
[9]  
Belloni A., 2013, ARXIV13127186V1
[10]  
Belloni A, 2015, ARXIV151207619