DOUBLY DEBIASED LASSO: HIGH-DIMENSIONAL INFERENCE UNDER HIDDEN CONFOUNDING

被引:9
作者
Guo, Zijian [1 ]
Cevid, Domagoj [2 ]
Buhlmann, Peter [2 ]
机构
[1] Rutgers State Univ, Dept Stat, Piscataway, NJ 08854 USA
[2] Swiss Fed Inst Technol, Seminar Stat, Zurich, Switzerland
基金
欧洲研究理事会;
关键词
Causal inference; structural equation model; dense confounding; linear model; spectral deconfounding; COVARIANCE-MATRIX ESTIMATION; CONFIDENCE-INTERVALS; INVALID INSTRUMENTS; GENE-EXPRESSION; SELECTION; VARIABLES; REGIONS; MODELS; NUMBER; RATES;
D O I
10.1214/21-AOS2152
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Inferring causal relationships or related associations from observational data can be invalidated by the existence of hidden confounding. We focus on a high-dimensional linear regression setting, where the measured covariates are affected by hidden confounding and propose the doubly debiased lasso estimator for individual components of the regression coefficient vector. Our advocated method simultaneously corrects both the bias due to estimation of high-dimensional parameters as well as the bias caused by the hidden confounding. We establish its asymptotic normality and also prove that it is efficient in the Gauss-Markov sense. The validity of our methodology relies on a dense confounding assumption, that is, that every confounding variable affects many covariates. The finite sample performance is illustrated with an extensive simulation study and a genomic application. The method is implemented by the DDL package available from CRAN.
引用
收藏
页码:1320 / 1347
页数:28
相关论文
共 68 条
[1]   Inferential theory for factor models of large dimensions. [J].
Bai, J .
ECONOMETRICA, 2003, 71 (01) :135-171
[2]   Determining the number of factors in approximate factor models [J].
Bai, JS ;
Ng, S .
ECONOMETRICA, 2002, 70 (01) :191-221
[3]   PROGRAM EVALUATION AND CAUSAL INFERENCE WITH HIGH-DIMENSIONAL DATA [J].
Belloni, A. ;
Chernozhukov, V. ;
Fernandez-Val, I. ;
Hansen, C. .
ECONOMETRICA, 2017, 85 (01) :233-298
[4]   Inference on Treatment Effects after Selection among High-Dimensional ControlsaEuro [J].
Belloni, Alexandre ;
Chernozhukov, Victor ;
Hansen, Christian .
REVIEW OF ECONOMIC STUDIES, 2014, 81 (02) :608-650
[5]   SIMULTANEOUS ANALYSIS OF LASSO AND DANTZIG SELECTOR [J].
Bickel, Peter J. ;
Ritov, Ya'acov ;
Tsybakov, Alexandre B. .
ANNALS OF STATISTICS, 2009, 37 (04) :1705-1732
[6]   Sample size importantly limits the usefulness of instrumental variable methods, depending on instrument strength and level of confounding [J].
Boef, Anna G. C. ;
Dekkers, Olaf M. ;
Vandenbroucke, Jan P. ;
le Cessie, Saskia .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2014, 67 (11) :1258-1264
[7]  
Bühlmann P, 2011, SPRINGER SER STAT, P1, DOI 10.1007/978-3-642-20192-9
[8]   A review of instrumental variable estimators for Mendelian randomization [J].
Burgess, Stephen ;
Small, Dylan S. ;
Thompson, Simon G. .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2017, 26 (05) :2333-2355
[9]   CONFIDENCE INTERVALS FOR HIGH-DIMENSIONAL LINEAR REGRESSION: MINIMAX RATES AND ADAPTIVITY [J].
Cai, T. Tony ;
Guo, Zijian .
ANNALS OF STATISTICS, 2017, 45 (02) :615-646
[10]   A Constrained l1 Minimization Approach to Sparse Precision Matrix Estimation [J].
Cai, Tony ;
Liu, Weidong ;
Luo, Xi .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (494) :594-607