Model selection and inference for estimation of causal parameters

被引:1
作者
Rothenhausler, Dominik [1 ]
机构
[1] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
关键词
Causal inference; model selection; data fusion; efficiency; CROSS-VALIDATION;
D O I
10.1214/24-EJS2308
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In causal inference there are often multiple reasonable estimators for a given target quantity. For example, one may reasonably use inverse probability weighting, an instrumental variables approach, or construct an estimate based on proxy outcomes if the actual outcome is difficult to measure. Ideally, the practitioner decides on an estimator before looking at the data. However, this might be challenging in practice since a priori it might not be clear to a practitioner how to choose the method. If the final model is chosen after peeking at the data, naive inferential procedures may fail. This raises the need for a model selection tool, with rigorous asymptotic guarantees. Since there is usually no loss function available in causal inference, standard model selection techniques do not apply. We propose a model selection procedure that estimates the squared pound 2- deviation of a finite-dimensional estimator from its target. The procedure relies on knowing an asymptotically unbiased (potentially highly variable) estimate of the parameter of interest. The resulting estimator is discontinuous and does not have a Gaussian limit distribution. Thus, standard asymptotic expansions do not apply. We derive asymptotically valid confidence intervals for low-dimensional settings that take into account the model selection step. The performance of the approach for estimation and inference for average treatment effects is evaluated on simulated data sets in low-dimensional settings, including experimental data, instrumental variables settings and observational data with selection on observables.
引用
收藏
页码:5449 / 5483
页数:35
相关论文
共 50 条
[31]   Causal Inference under Interference and Model Uncertainty [J].
Zhang, Chi ;
Mohan, Karthika ;
Pearl, Judea .
CONFERENCE ON CAUSAL LEARNING AND REASONING, VOL 213, 2023, 213 :371-385
[32]   The estimation of causal effects: an introduction to methods of causal inference based on peer effects in education [J].
Legewie, Joscha .
KOLNER ZEITSCHRIFT FUR SOZIOLOGIE UND SOZIALPSYCHOLOGIE, 2012, 64 (01) :123-153
[33]   Covariate selection with group lasso and doubly robust estimation of causal effects [J].
Koch, Brandon ;
Vock, David M. ;
Wolfson, Julian .
BIOMETRICS, 2018, 74 (01) :8-17
[34]   INFLUENCE ESTIMATION ON SOCIAL MEDIA NETWORKS USING CAUSAL INFERENCE [J].
Smith, Steven T. ;
Kao, Edward K. ;
Shah, Danelle C. ;
Simek, Olga ;
Rubin, Donald B. .
2018 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP (SSP), 2018, :328-332
[35]   Doubly robust estimation in missing data and causal inference models [J].
Bang, H .
BIOMETRICS, 2005, 61 (04) :962-972
[36]   Targeted Maximum Likelihood Estimation for Causal Inference in Observational Studies [J].
Schuler, Megan S. ;
Rose, Sherri .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2017, 185 (01) :65-73
[37]   Data integration in causal inference [J].
Shi, Xu ;
Pan, Ziyang ;
Miao, Wang .
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2023, 15 (01)
[38]   Model selection in occupancy models: Inference versus prediction [J].
Stewart, Peter S. ;
Stephens, Philip A. ;
Hill, Russell A. ;
Whittingham, Mark J. ;
Dawson, Wayne .
ECOLOGY, 2023, 104 (03)
[39]   SEMIPARAMETRIC BAYESIAN CAUSAL INFERENCE [J].
Ray, Kolyan ;
van der Vaart, Aad .
ANNALS OF STATISTICS, 2020, 48 (05) :2999-3020
[40]   Graph Disentangle Causal Model: Enhancing Causal Inference in Networked Observational Data [J].
Hu, Binbin ;
An, Zhicheng ;
Wu, Zhengwei ;
Tu, Ke ;
Liu, Ziqi ;
Zhang, Zhiqiang ;
Zhou, Jun ;
Feng, Yufei ;
Chen, Jiawei .
PROCEEDINGS OF THE EIGHTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, WSDM 2025, 2025, :1-9