ESTIMATING TREATMENT EFFECT HETEROGENEITY IN RANDOMIZED PROGRAM EVALUATION

被引:315
作者
Imai, Kosuke [1 ]
Ratkovic, Marc [1 ]
机构
[1] Princeton Univ, Dept Polit, Princeton, NJ 08544 USA
关键词
Causal inference; individualized treatment rules; LASSO; moderation; variable selection; SUPPORT VECTOR MACHINES; VARIABLE SELECTION; SUBGROUP ANALYSIS; PROPENSITY SCORE; CLINICAL-TRIALS; TREATMENT RULES; VOTER TURNOUT; REGRESSION; CLASSIFICATION; REGULARIZATION;
D O I
10.1214/12-AOAS593
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
When evaluating the efficacy of social programs and medical treatments using randomized experiments, the estimated overall average causal effect alone is often of limited value and the researchers must investigate when the treatments do and do not work. Indeed, the estimation of treatment effect heterogeneity plays an essential role in (1) selecting the most effective treatment from a large number of available treatments, (2) ascertaining subpopulations for which a treatment is effective or harmful, (3) designing individualized optimal treatment regimes, (4) testing for the existence or lack of heterogeneous treatment effects, and (5) generalizing causal effect estimates obtained from an experimental sample to a target population. In this paper, we formulate the estimation of heterogeneous treatment effects as a variable selection problem. We propose a method that adapts the Support Vector Machine classifier by placing separate sparsity constraints over the pre-treatment parameters and causal heterogeneity parameters of interest. The proposed method is motivated by and applied to two well-known randomized evaluation studies in the social sciences. Our method selects the most effective voter mobilization strategies from a large number of alternative strategies, and it also identifies the characteristics of workers who greatly benefit from (or are negatively affected by) a job training program. In our simulation studies, we find that the proposed method often outperforms some commonly used alternatives.
引用
收藏
页码:443 / 470
页数:28
相关论文
共 59 条
[1]  
[Anonymous], 1990, Statistical Science, DOI DOI 10.1214/SS/1177012031
[2]  
[Anonymous], 1984, Wadsworth Statistics/Probability Series
[3]  
[Anonymous], 2006, Journal of the Royal Statistical Society, Series B
[4]  
Bradley P. S., 1998, Machine Learning. Proceedings of the Fifteenth International Conference (ICML'98), P82
[5]   BART: BAYESIAN ADDITIVE REGRESSION TREES [J].
Chipman, Hugh A. ;
George, Edward I. ;
McCulloch, Robert E. .
ANNALS OF APPLIED STATISTICS, 2010, 4 (01) :266-298
[6]   Generalizing Evidence From Randomized Clinical Trials to Target Populations [J].
Cole, Stephen R. ;
Stuart, Elizabeth A. .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2010, 172 (01) :107-115
[7]   Nonparametric tests for treatment effect heterogeneity [J].
Crump, Richard K. ;
Hotz, V. Joseph ;
Imbens, Guido W. ;
Mitnik, Oscar A. .
REVIEW OF ECONOMICS AND STATISTICS, 2008, 90 (03) :389-405
[8]   TREATMENT EFFECT HETEROGENEITY IN PAIRED DATA [J].
DAVISON, AC .
BIOMETRIKA, 1992, 79 (03) :463-474
[9]   Causal effects in, nonexperimental studies: Reevaluating the evaluation of training programs [J].
Dehejia, RH ;
Wahba, S .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1999, 94 (448) :1053-1062
[10]   Least angle regression - Rejoinder [J].
Efron, B ;
Hastie, T ;
Johnstone, I ;
Tibshirani, R .
ANNALS OF STATISTICS, 2004, 32 (02) :494-499