To use or not to use propensity score matching?

被引:88
作者
Wang, Jixian [1 ]
机构
[1] Celgene Int Sarl, Route Perreux 1, Boudry, Basel, Switzerland
关键词
causal inference; dose-exposure-response relationship; health technology assessment; modeling and simulation; MULTIVARIATE; PERFORMANCE; BALANCE; ESTIMATORS;
D O I
10.1002/pst.2051
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
Propensity score matching (PSM) has been widely used to reduce confounding biases in observational studies. Its properties for statistical inference have also been investigated and well documented. However, some recent publications showed concern of using PSM, especially on increasing postmatching covariate imbalance, leading to discussion on whether PSM should be used or not. We review empirical and theoretical evidence for and against its use in practice and revisit the property of equal percent bias reduction and adapt it to more practical situations, showing that PSM has some additional desirable properties. With a small simulation, we explore the impact of caliper width on biases due to mismatching in matched samples and due to the difference between matched and target populations and show some issue of PSM may be due to inadequate caliper selection. In summary, we argue that the right question should be when and how to use PSM rather than to use or not to use it and give suggestions accordingly.
引用
收藏
页码:15 / 24
页数:10
相关论文
共 44 条
[1]   Large sample properties of matching estimators for average treatment effects [J].
Abadie, A ;
Imbens, GW .
ECONOMETRICA, 2006, 74 (01) :235-267
[2]   Matching on the Estimated Propensity Score [J].
Abadie, Alberto ;
Imbens, Guido W. .
ECONOMETRICA, 2016, 84 (02) :781-807
[3]   The use of bootstrapping when using propensity-score matching without replacement: a simulation study [J].
Austin, Peter C. ;
Small, Dylan S. .
STATISTICS IN MEDICINE, 2014, 33 (24) :4306-4319
[4]   The use of propensity score methods with survival or time-to-event outcomes: reporting measures of effect similar to those used in randomized experiments [J].
Austin, Peter C. .
STATISTICS IN MEDICINE, 2014, 33 (07) :1242-1258
[5]   A comparison of 12 algorithms for matching on the propensity score [J].
Austin, Peter C. .
STATISTICS IN MEDICINE, 2014, 33 (06) :1057-1069
[6]   The performance of different propensity score methods for estimating marginal hazard ratios [J].
Austin, Peter C. .
STATISTICS IN MEDICINE, 2013, 32 (16) :2837-2849
[7]   Comparing paired vs non-paired statistical methods of analyses when making inferences about absolute risk reductions in propensity-score matched samples [J].
Austin, Peter C. .
STATISTICS IN MEDICINE, 2011, 30 (11) :1292-1301
[8]   Optimal caliper widths for propensity-score matching when estimating differences in means and differences in proportions in observational studies [J].
Austin, Peter C. .
PHARMACEUTICAL STATISTICS, 2011, 10 (02) :150-161
[9]   The performance of different propensity-score methods for estimating differences in proportions (risk differences or absolute risk reductions) in observational studies [J].
Austin, Peter C. .
STATISTICS IN MEDICINE, 2010, 29 (20) :2137-2148
[10]   Some Methods of Propensity-Score Matching had Superior Performance to Others: Results of an Empirical Investigation and Monte Carlo simulations [J].
Austin, Peter C. .
BIOMETRICAL JOURNAL, 2009, 51 (01) :171-184