Model selection and inference for estimation of causal parameters

被引：0

作者：

Rothenhausler, Dominik ^{[1
]}

机构：

[1] Stanford Univ, Dept Stat, Stanford, CA 94305 USA

来源：

ELECTRONIC JOURNAL OF STATISTICS | 2024年 / 18卷 / 02期

关键词：

Causal inference; model selection; data fusion; efficiency; CROSS-VALIDATION;

D O I：

10.1214/24-EJS2308

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

In causal inference there are often multiple reasonable estimators for a given target quantity. For example, one may reasonably use inverse probability weighting, an instrumental variables approach, or construct an estimate based on proxy outcomes if the actual outcome is difficult to measure. Ideally, the practitioner decides on an estimator before looking at the data. However, this might be challenging in practice since a priori it might not be clear to a practitioner how to choose the method. If the final model is chosen after peeking at the data, naive inferential procedures may fail. This raises the need for a model selection tool, with rigorous asymptotic guarantees. Since there is usually no loss function available in causal inference, standard model selection techniques do not apply. We propose a model selection procedure that estimates the squared pound 2- deviation of a finite-dimensional estimator from its target. The procedure relies on knowing an asymptotically unbiased (potentially highly variable) estimate of the parameter of interest. The resulting estimator is discontinuous and does not have a Gaussian limit distribution. Thus, standard asymptotic expansions do not apply. We derive asymptotically valid confidence intervals for low-dimensional settings that take into account the model selection step. The performance of the approach for estimation and inference for average treatment effects is evaluated on simulated data sets in low-dimensional settings, including experimental data, instrumental variables settings and observational data with selection on observables.

引用

页码：5449 / 5483

页数：35

共 50 条

[1] On model selection and model misspecification in causal inference
Vansteelandt, Stijn
Bekaert, Maarten
Claeskens, Gerda
STATISTICAL METHODS IN MEDICAL RESEARCH, 2012, 21 (01) : 7 - 30
[2] A biologist's guide to model selection and causal inference
Laubach, Zachary M.
Murray, Eleanor J.
Hoke, Kim L.
Safran, Rebecca J.
Perng, Wei
PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2021, 288 (1943)
[3] An Alternative Doubly Robust Estimation in Causal Inference Model
Wei, Shaojie
Li, Gaorong
Zhang, Zhongzhan
COMMUNICATIONS IN MATHEMATICS AND STATISTICS, 2024, 12 (04) : 659 - 678
[4] THE ROLE OF MODEL SELECTION IN CAUSAL INFERENCE FROM NONEXPERIMENTAL DATA
ROBINS, JM
GREENLAND, S
AMERICAN JOURNAL OF EPIDEMIOLOGY, 1986, 123 (03) : 392 - 402
[5] Applying optimal model selection in principal stratification for causal inference
Odondi, Lang'o
McNamee, Roseanne
STATISTICS IN MEDICINE, 2013, 32 (11) : 1815 - 1828
[6] Variable selection and estimation in causal inference using Bayesian spike and slab priors
Koch, Brandon
Vock, David M.
Wolfson, Julian
Vock, Laura Boehm
STATISTICAL METHODS IN MEDICAL RESEARCH, 2020, 29 (09) : 2445 - 2469
[7] Evaluation framework to guide model selection and cohort definition in causal inference
Shimoni, Yishia
Ravid, Sivan
Karavani, Ehud
Bak, Peter
Ng, Marie
Alford, Sharon Hensley
Meade, Denise
Goldschmidt, Ya'ara
PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2019, 28 : 202 - 203
[8] Inference and model selection in general causal time series with exogenous covariates
Diop, Mamadou Lamine
Kengne, William
ELECTRONIC JOURNAL OF STATISTICS, 2022, 16 (01): : 116 - 157
[9] Multi-parameters Model Selection for Network Inference
Tozzo, Veronica
Barla, Annalisa
COMPLEX NETWORKS AND THEIR APPLICATIONS VIII, VOL 1, 2020, 881 : 566 - 577
[10] Multiple robustness estimation in causal inference
Wang, Lei
COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2019, 48 (23) : 5701 - 5718

← 1 2 3 4 5 →