Multiple hypothesis testing in experimental economics

被引:207
作者
List, John A. [1 ]
Shaikh, Azeem M. [1 ]
Xu, Yang [1 ]
机构
[1] Univ Chicago, Dept Econ, 5757 S Univ Ave, Chicago, IL 60637 USA
基金
美国国家科学基金会;
关键词
Experiments; Multiple hypothesis testing; Multiple treatments; Multiple outcomes; Multiple subgroups; Randomized controlled trial; Bootstrap; Balance; PERRY PRESCHOOL; BEHAVIORALIST;
D O I
10.1007/s10683-018-09597-5
中图分类号
F [经济];
学科分类号
02 ;
摘要
The analysis of data from experiments in economics routinely involves testing multiple null hypotheses simultaneously. These different null hypotheses arise naturally in this setting for at least three different reasons: when there are multiple outcomes of interest and it is desired to determine on which of these outcomes a treatment has an effect; when the effect of a treatment may be heterogeneous in that it varies across subgroups defined by observed characteristics and it is desired to determine for which of these subgroups a treatment has an effect; and finally when there are multiple treatments of interest and it is desired to determine which treatments have an effect relative to either the control or relative to each of the other treatments. In this paper, we provide a bootstrap-based procedure for testing these null hypotheses simultaneously using experimental data in which simple random sampling is used to assign treatment status to units. Using the general results in Romano and Wolf (Ann Stat 38:598-633, 2010), we show under weak assumptions that our procedure (1) asymptotically controls the familywise error rate-the probability of one or more false rejections-and (2) is asymptotically balanced in that the marginal probability of rejecting any true null hypothesis is approximately equal in large samples. Importantly, by incorporating information about dependence ignored in classical multiple testing procedures, such as the Bonferroni and Holm corrections, our procedure has much greater ability to detect truly false null hypotheses. In the presence of multiple treatments, we additionally show how to exploit logical restrictions across null hypotheses to further improve power. We illustrate our methodology by revisiting the study by Karlan and List (Am Econ Rev 97(5):1774-1793, 2007) of why people give to charitable causes.
引用
收藏
页码:773 / 793
页数:21
相关论文
共 36 条
[1]   Multiple Inference and Gender Differences in the Effects of Early Intervention: A Reevaluation of the Abecedarian, Perry Preschool, and Early Training Projects [J].
Anderson, Michael L. .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2008, 103 (484) :1481-1495
[2]   THE SEARCH FOR ASTERISKS: COMPROMISED STATISTICAL TESTS AND FLAWED THEORIES [J].
Bettis, Richard A. .
STRATEGIC MANAGEMENT JOURNAL, 2012, 33 (01) :108-113
[3]   Treatment effect bounds: An application to Swan-Ganz catheterization [J].
Bhattacharya, Jay ;
Shaikh, Azeem M. ;
Vytlacil, Edward .
JOURNAL OF ECONOMETRICS, 2012, 168 (02) :223-243
[4]  
Bonferroni C.E., 1935, Il calcolo delle assicurazioni su gruppi di teste
[5]  
Bugni F., 2015, TECHNICAL REPORT
[6]   Evaluating replicability of laboratory experiments in economics [J].
Camerer, Colin F. ;
Dreber, Anna ;
Forsell, Eskil ;
Ho, Teck-Hua ;
Huber, Juergen ;
Johannesson, Magnus ;
Kirchler, Michael ;
Almenberg, Johan ;
Altmejd, Adam ;
Chan, Taizan ;
Heikensten, Emma ;
Holzmeister, Felix ;
Imai, Taisuke ;
Isaksson, Siri ;
Nave, Gideon ;
Pfeiffer, Thomas ;
Razen, Michael ;
Wu, Hang .
SCIENCE, 2016, 351 (6280) :1433-1436
[7]   Testing for heterogeneous treatment effects in experimental data: false discovery risks and correction procedures [J].
Fink, Guenther ;
McConnell, Margaret ;
Vollmer, Sebastian .
JOURNAL OF DEVELOPMENT EFFECTIVENESS, 2014, 6 (01) :44-57
[8]  
Flory J. A., 2015, GENDER AGE COM UNPUB
[9]   Do Competitive Workplaces Deter Female Workers? A Large-Scale Natural Field Experiment on Job Entry Decisions [J].
Flory, Jeffrey A. ;
Leibbrandt, Andreas ;
List, John A. .
REVIEW OF ECONOMIC STUDIES, 2015, 82 (01) :122-155
[10]   Performance in competitive environments: Gender differences [J].
Gneezy, U ;
Niederle, M ;
Rustichini, A .
QUARTERLY JOURNAL OF ECONOMICS, 2003, 118 (03) :1049-1074