Model selection and inference for estimation of causal parameters

被引:0
|
作者
Rothenhausler, Dominik [1 ]
机构
[1] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
来源
ELECTRONIC JOURNAL OF STATISTICS | 2024年 / 18卷 / 02期
关键词
Causal inference; model selection; data fusion; efficiency; CROSS-VALIDATION;
D O I
10.1214/24-EJS2308
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In causal inference there are often multiple reasonable estimators for a given target quantity. For example, one may reasonably use inverse probability weighting, an instrumental variables approach, or construct an estimate based on proxy outcomes if the actual outcome is difficult to measure. Ideally, the practitioner decides on an estimator before looking at the data. However, this might be challenging in practice since a priori it might not be clear to a practitioner how to choose the method. If the final model is chosen after peeking at the data, naive inferential procedures may fail. This raises the need for a model selection tool, with rigorous asymptotic guarantees. Since there is usually no loss function available in causal inference, standard model selection techniques do not apply. We propose a model selection procedure that estimates the squared pound 2- deviation of a finite-dimensional estimator from its target. The procedure relies on knowing an asymptotically unbiased (potentially highly variable) estimate of the parameter of interest. The resulting estimator is discontinuous and does not have a Gaussian limit distribution. Thus, standard asymptotic expansions do not apply. We derive asymptotically valid confidence intervals for low-dimensional settings that take into account the model selection step. The performance of the approach for estimation and inference for average treatment effects is evaluated on simulated data sets in low-dimensional settings, including experimental data, instrumental variables settings and observational data with selection on observables.
引用
收藏
页码:5449 / 5483
页数:35
相关论文
共 50 条
  • [1] On model selection and model misspecification in causal inference
    Vansteelandt, Stijn
    Bekaert, Maarten
    Claeskens, Gerda
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2012, 21 (01) : 7 - 30
  • [2] A biologist's guide to model selection and causal inference
    Laubach, Zachary M.
    Murray, Eleanor J.
    Hoke, Kim L.
    Safran, Rebecca J.
    Perng, Wei
    PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2021, 288 (1943)
  • [3] An Alternative Doubly Robust Estimation in Causal Inference Model
    Wei, Shaojie
    Li, Gaorong
    Zhang, Zhongzhan
    COMMUNICATIONS IN MATHEMATICS AND STATISTICS, 2024, 12 (04) : 659 - 678
  • [4] THE ROLE OF MODEL SELECTION IN CAUSAL INFERENCE FROM NONEXPERIMENTAL DATA
    ROBINS, JM
    GREENLAND, S
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 1986, 123 (03) : 392 - 402
  • [5] Applying optimal model selection in principal stratification for causal inference
    Odondi, Lang'o
    McNamee, Roseanne
    STATISTICS IN MEDICINE, 2013, 32 (11) : 1815 - 1828
  • [6] Variable selection and estimation in causal inference using Bayesian spike and slab priors
    Koch, Brandon
    Vock, David M.
    Wolfson, Julian
    Vock, Laura Boehm
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2020, 29 (09) : 2445 - 2469
  • [7] Evaluation framework to guide model selection and cohort definition in causal inference
    Shimoni, Yishia
    Ravid, Sivan
    Karavani, Ehud
    Bak, Peter
    Ng, Marie
    Alford, Sharon Hensley
    Meade, Denise
    Goldschmidt, Ya'ara
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2019, 28 : 202 - 203
  • [8] Inference and model selection in general causal time series with exogenous covariates
    Diop, Mamadou Lamine
    Kengne, William
    ELECTRONIC JOURNAL OF STATISTICS, 2022, 16 (01): : 116 - 157
  • [9] Multi-parameters Model Selection for Network Inference
    Tozzo, Veronica
    Barla, Annalisa
    COMPLEX NETWORKS AND THEIR APPLICATIONS VIII, VOL 1, 2020, 881 : 566 - 577
  • [10] Multiple robustness estimation in causal inference
    Wang, Lei
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2019, 48 (23) : 5701 - 5718