Inference on Treatment Effects after Selection among High-Dimensional ControlsaEuro

被引:706
|
作者
Belloni, Alexandre [1 ]
Chernozhukov, Victor [2 ]
Hansen, Christian [3 ]
机构
[1] Duke Univ, Durham, NC 27706 USA
[2] MIT, Cambridge, MA 02139 USA
[3] Univ Chicago, Chicago, IL 60637 USA
来源
REVIEW OF ECONOMIC STUDIES | 2014年 / 81卷 / 02期
基金
美国国家科学基金会;
关键词
Treatment effects; Partially linear model; High-dimensional-sparse regression; Inference under imperfect model selection; Uniformly valid inference after model selection; Average treatment effects; Lasso; Orthogonality of estimating equations with respect to nuisance parameters; EFFICIENT SEMIPARAMETRIC ESTIMATION; LEGALIZED ABORTION; MODEL-SELECTION; VARIABLE SELECTION; REGRESSION; ESTIMATORS; MOMENT; IMPACT; LASSO; CRIME;
D O I
10.1093/restud/rdt044
中图分类号
F [经济];
学科分类号
02 ;
摘要
We propose robust methods for inference about the effect of a treatment variable on a scalar outcome in the presence of very many regressors in a model with possibly non-Gaussian and heteroscedastic disturbances. We allow for the number of regressors to be larger than the sample size. To make informative inference feasible, we require the model to be approximately sparse; that is, we require that the effect of confounding factors can be controlled for up to a small approximation error by including a relatively small number of variables whose identities are unknown. The latter condition makes it possible to estimate the treatment effect by selecting approximately the right set of regressors. We develop a novel estimation and uniformly valid inference method for the treatment effect in this setting, called the "post-double-selection" method. The main attractive feature of our method is that it allows for imperfect selection of the controls and provides confidence intervals that are valid uniformly across a large class of models. In contrast, standard post-model selection estimators fail to provide uniform inference even in simple cases with a small, fixed number of controls. Thus, our method resolves the problem of uniform inference after model selection for a large, interesting class of models. We also present a generalization of our method to a fully heterogeneous model with a binary treatment variable. We illustrate the use of the developed methods with numerical simulations and an application that considers the effect of abortion on crime rates.
引用
收藏
页码:608 / 650
页数:43
相关论文
共 50 条
  • [21] An Information-Theoretic Approach to Universal Feature Selection in High-Dimensional Inference
    Huang, Shao-Lun
    Makur, Anuran
    Zheng, Lizhong
    Wornell, Gregory W.
    2017 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2017, : 1336 - 1340
  • [22] Post-selection inference for high-dimensional mediation analysis with survival outcomes
    Huang, Tzu-Jung
    Liu, Zhonghua
    Mckeague, Ian W.
    SCANDINAVIAN JOURNAL OF STATISTICS, 2025,
  • [23] MODEL-ASSISTED INFERENCE FOR COVARIATE-SPECIFIC TREATMENT EFFECTS WITH HIGH-DIMENSIONAL DATA
    Wu, Peng
    Tan, Zhiqiang
    Hu, Wenjie
    Zhou, Xiao-Hua
    STATISTICA SINICA, 2024, 34 (01) : 459 - 479
  • [24] High-dimensional model-assisted inference for treatment effects with multi-valued treatments
    Xu, Wenfu
    Tan, Zhiqiang
    JOURNAL OF ECONOMETRICS, 2024, 244 (01)
  • [25] High-Dimensional Model-Assisted Inference for Local Average Treatment Effects With Instrumental Variables
    Sun, Baoluo
    Tan, Zhiqiang
    JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2022, 40 (04) : 1732 - 1744
  • [26] Inference for treatment effect parameters in potentially misspecified high-dimensional models
    Dukes, Oliver
    Vansteelandt, Stijn
    BIOMETRIKA, 2021, 108 (02) : 321 - 334
  • [27] Covariate Selection in High-Dimensional Propensity Score Analyses of Treatment Effects in Small Samples
    Rassen, Jeremy A.
    Glynn, Robert J.
    Brookhart, M. Alan
    Schneeweiss, Sebastian
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2011, 173 (12) : 1404 - 1413
  • [28] HIGH-DIMENSIONAL VARIABLE SELECTION
    Wasserman, Larry
    Roeder, Kathryn
    ANNALS OF STATISTICS, 2009, 37 (5A): : 2178 - 2201
  • [29] Oracle inequalities, variable selection and uniform inference in high-dimensional correlated random effects panel data models
    Kock, Anders Bredahl
    JOURNAL OF ECONOMETRICS, 2016, 195 (01) : 71 - 85
  • [30] Inference and Estimation for Random Effects in High-Dimensional Linear Mixed Models
    Law, Michael
    Ritov, Ya'acov
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (543) : 1682 - 1691