Estimating Treatment Effect in the Wild via Differentiated Confounder Balancing

被引:54
作者
Kuang, Kun [1 ,4 ]
Cui, Peng [1 ]
Li, Bo [2 ]
Jiang, Meng [3 ]
Yang, Shiqiang [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci, Beijing, Peoples R China
[2] Tsinghua Univ, Sch Econ & Management, Beijing, Peoples R China
[3] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
[4] Tsinghua Natl Lab Informat Sci & Technol TNList, Beijing, Peoples R China
来源
KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING | 2017年
基金
国家教育部科学基金资助; 中国国家自然科学基金;
关键词
Treatment Effect Estimation; Confounding Bias; Differentiated Confounder Balancing; PROPENSITY SCORE ESTIMATION;
D O I
10.1145/3097983.3098032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The key challenge on estimating treatment effect in the wild observational studies is to handle confounding bias induced by imbalance of the confounder distributions between treated and control units. Traditional methods remove confounding bias by re-weighting units with supposedly accurate propensity score estimation under the unconfoundedness assumption. Controlling high-dimensional variables may make the unconfoundedness assumption more plausible, but poses new challenge on accurate propensity score estimation. One s-trand of recent literature seeks to directly optimize weights to balance confounder distributions, bypassing propensity score estimation. But existing balancing methods fail to do selection and differentiation among the pool of a large number of potential confounders, leading to possible underperformance in many high dimensional settings. In this paper, we propose a data-driven Differentiated Confounder Balancing (DCB) algorithm to jointly select confounders, differentiate weights of confounders and balance confounder distributions for treatment effect estimation in the wild high dimensional settings. The synergistic learning algorithm we proposed is more capable of reducing the confounding bias in many observational studies. To validate the effectiveness of our DCB algorithm, we conduct extensive experiments on both synthetic and real datasets. The experimental results clearly demonstrate that our DCB algorithm outperforms the state-of-the-art methods on treatment effect estimation.
引用
收藏
页码:265 / 274
页数:10
相关论文
共 26 条
[1]  
[Anonymous], 2017, P 31 AAAI C ART INT
[2]  
[Anonymous], 2010, P 16 ACM SIGKDD INT
[3]  
Athey S., 2016, EFFICIENT INFERENCE
[4]  
Athey Susan., 2015, Stat, V1050, P5
[5]   A Tutorial and Case Study in Propensity Score Analysis: An Application to Estimating the Effect of In-Hospital Smoking Cessation Counseling on Mortality [J].
Austin, Peter C. .
MULTIVARIATE BEHAVIORAL RESEARCH, 2011, 46 (01) :119-151
[6]   Doubly robust estimation in missing data and causal inference models [J].
Bang, H .
BIOMETRICS, 2005, 61 (04) :962-972
[7]  
Bottou L, 2013, J MACH LEARN RES, V14, P3207
[8]  
Chan K. C. G., 2015, J ROYAL STAT SOC B
[9]  
Chernozhukov, 2016, DOUBLE DEBIASED MACH
[10]   GENETIC MATCHING FOR ESTIMATING CAUSAL EFFECTS: A GENERAL MULTIVARIATE MATCHING METHOD FOR ACHIEVING BALANCE IN OBSERVATIONAL STUDIES [J].
Diamond, Alexis ;
Sekhon, Jasjeet S. .
REVIEW OF ECONOMICS AND STATISTICS, 2013, 95 (03) :932-945