Calibration of Heterogeneous Treatment Effects in Randomized Experiments

被引：1

作者：

Leng, Yan ^{[1
]}

Dimmery, Drew ^{[2
]}

机构：

[1] Univ Texas Austin, McCombs Sch Business, Austin, TX 78705 USA

[2] Univ Vienna, Res Network Data Sci, A-1090 Vienna, Austria

来源：

INFORMATION SYSTEMS RESEARCH | 2024年 / 35卷 / 04期

基金：

美国国家科学基金会;

关键词：

causal inference; heterogeneous treatment effects; randomized experiments; calibration; machine learning; REGRESSION; FRAMEWORK;

D O I：

10.1287/isre.2021.0343

中图分类号：

G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];

学科分类号：

1205 ; 120501 ;

摘要：

Machine learning is commonly used to estimate the heterogeneous treatment effects (HTEs) in randomized experiments. Using large-scale randomized experiments on the Facebook and Criteo platforms, we observe substantial discrepancies between machine learning-based treatment effect estimates and difference-in-means estimates directly from the randomized experiment. This paper provides a two-step framework for practitioners and researchers to diagnose and rectify this discrepancy. We first introduce a diagnostic tool to assess whether bias exists in the model-based estimates from machine learning. If bias exists, we then offer a model-agnostic method to calibrate any HTE estimates to known, unbiased, subgroup difference-in-means estimates, ensuring that the sign and magnitude of the subgroup estimates approximate the model-free benchmarks. This calibration method requires no additional data and can be scaled for large data sets. To highlight potential sources of bias, we theoretically show that this bias can result from regularization and further use synthetic simulation to show biases result from misspecification and high-dimensional features. We demonstrate the efficacy of our calibration method using extensive synthetic simulations and two real-world randomized experiments. We further demonstrate the practical value of this calibration in three typical policy-making settings: a prescriptive, budget-constrained optimization framework; a setting seeking to maximize multiple performance indicators; and a multitreatment uplift modeling setting.

引用

页码：1721 / 1742

页数：22

共 63 条

[1]

Aronow P., 2021, arXiv

[2]

Athey S, 2015, STAT, V1050, P1

[3] Recursive partitioning for heterogeneous causal effects [J].

Athey, Susan ;

Imbens, Guido .

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (27) :7353-7360

[4] Random forests [J].

Breiman, L .

MACHINE LEARNING, 2001, 45 (01) :5-32

[5]

Casella G., 2002, Statistical inference, V2

[6] Using users: When does external knowledge enhance corporate product innovation? [J].

Chatterji, Aaron K. ;

Fabrizio, Kira R. .

STRATEGIC MANAGEMENT JOURNAL, 2014, 35 (10) :1427-1445

[7]

Chernozhukov V, 2023, NBER Working Paper No. 24678

[8] The Sorted Effects Method: Discovering Heterogeneous Effects Beyond Their Averages [J].

Chernozhukov, Victor ;

Fernandez-Val, Ivan ;

Luo, Ye .

ECONOMETRICA, 2018, 86 (06) :1911-1938

[9] Data-Driven Metric Development for Online Controlled Experiments: Seven Lessons Learned [J].

Deng, Alex ;

Shi, Xiaolin .

KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :77-86

[10]

Diemert E, 2018, P ADKDD TARGETAD ADK, P1

← 1 2 3 4 5 6 7 →