Nonparametric inverse-probability-weighted estimators based on the highly adaptive lasso

被引:7
作者
Ertefaie, Ashkan [1 ]
Hejazi, Nima S. [2 ]
van der Laan, Mark J. [3 ,4 ]
机构
[1] Univ Rochester, Dept Biostat & Computat Biol, Rochester, NY 14627 USA
[2] Weill Cornell Med, Div Biostat, Dept Populat lIealth Sci, New York, NY USA
[3] Univ Calif Berkeley, Sch Publ Hlth, Div Biostat, Berkeley, CA 94720 USA
[4] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
关键词
adaptive estimation; causal inference; efficient influence function; semiparametric efficiency; DOUBLY ROBUST ESTIMATION; EFFICIENT ESTIMATION; MULTIPLE IMPUTATION; CAUSAL INFERENCE; REGRESSION;
D O I
10.1111/biom.13719
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Inverse-probability-weighted estimators are the oldest and potentially most commonly used class of procedures for the estimation of causal effects. By adjusting for selection biases via a weighting mechanism, these procedures estimate an effect of interest by constructing a pseudopopulation in which selection biases are eliminated. Despite their ease of use, these estimators require the correct specification of a model for the weighting mechanism, are known to be inefficient, and suffer from the curse of dimensionality. We propose a class of nonparametric inverse-probability-weighted estimators in which the weighting mechanism is estimated via undersmoothing of the highly adaptive lasso, a nonparametric regression function proven to converge at nearly n-1/3-rate to the true weighting mechanism. We demonstrate that our estimators are asymptotically linear with variance converging to the nonparametric efficiency bound. Unlike doubly robust estimators, our procedures require neither derivation of the efficient influence function nor specification of the conditional outcome model. Our theoretical developments have broad implications for the construction of efficient inverse-probability-weighted estimators in large statistical models and a variety of problem settings. We assess the practical performance of our estimators in simulation studies and demonstrate use of our proposed methodology with data from a large-scale epidemiologic study.
引用
收藏
页码:1029 / 1041
页数:13
相关论文
共 36 条
[1]  
[Anonymous], 2006, All of nonparametric statistics, DOI DOI 10.1007/0-387-30623-4
[2]   Using the Standardized Difference to Compare the Prevalence of a Binary Variable Between Two Groups in Observational Research [J].
Austin, Peter C. .
COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2009, 38 (06) :1228-1234
[3]   Doubly robust estimation in missing data and causal inference models [J].
Bang, H .
BIOMETRICS, 2005, 61 (04) :962-972
[4]   Doubly robust nonparametric inference on the average treatment effect [J].
Benkeser, D. ;
Carone, M. ;
van der Laan, M. J. ;
Gilbert, P. B. .
BIOMETRIKA, 2017, 104 (04) :863-880
[5]   The Highly Adaptive Lasso Estimator [J].
Benkeser, David ;
van der Laan, Mark .
PROCEEDINGS OF 3RD IEEE/ACM INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS, (DSAA 2016), 2016, :689-696
[6]  
Bibaut A. F., 2019, ARXIV
[7]  
Cai W.P., 2019, arXiv
[8]   Improving efficiency and robustness of the doubly robust estimator for a population mean with incomplete data [J].
Cao, Weihua ;
Tsiatis, Anastasios A. ;
Davidian, Marie .
BIOMETRIKA, 2009, 96 (03) :723-734
[9]   A comparison of multiple imputation and doubly robust estimation for analyses with missing data [J].
Carpenter, James R. ;
Kenward, Michael G. ;
Vansteelandt, Stijn .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2006, 169 :571-584
[10]   MACHINE LEARNING IN ECONOMETRICS Double/Debiased/Neyman Machine Learning of Treatment Effects [J].
Chernozhukov, Victor ;
Chetverikov, Denis ;
Demirer, Mert ;
Duflo, Esther ;
Hansen, Christian ;
Newey, Whitney .
AMERICAN ECONOMIC REVIEW, 2017, 107 (05) :261-265