Controlling Decision Errors With Minimal Costs: The Sequential Probability Ratio t Test

被引:29
作者
Schnuerch, Martin [1 ]
Erdfelder, Edgar [1 ]
机构
[1] Univ Mannheim, Sch Social Sci, Dept Psychol, D-68131 Mannheim, Germany
关键词
hypothesis testing; efficiency; statistical error probabilities; sequential analysis; sequential probability ratio test; STATISTICAL POWER; EFFECT SIZE; PSYCHOLOGICAL SCIENCE; RANDOMIZED-TRIALS; HYPOTHESIS TESTS; SAMPLE-SIZE; REPLICATION; REPLICABILITY; LIKELIHOOD; RECOMMENDATIONS;
D O I
10.1037/met0000234
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
For several years, the public debate in psychological science has been dominated by what is referred to as the reproducibility crisis. This crisis has, inter alia, drawn attention to the need for proper control of statistical decision errors in testing psychological hypotheses. However, conventional methods of error probability control often require fairly large samples. Sequential statistical tests provide an attractive alternative: They can be applied repeatedly during the sampling process and terminate whenever there is sufficient evidence in the data for one of the hypotheses of interest. Thus, sequential tests may substantially reduce the required sample size without compromising predefined error probabilities. Herein, we discuss the most efficient sequential design, the sequential probability ratio test (SPRT), and show how it is easily implemented for a 2-sample t test using standard statistical software. We demonstrate, by means of simulations, that the SPRT not only reliably controls error probabilities but also typically requires substantially smaller samples than standard t tests and other common sequential designs. Moreover, we investigate the robustness of the SPRT against violations of its assumptions. Finally, we illustrate the sequential t test by applying it to an empirical example and provide recommendations on how psychologists can employ it in their own research to benefit from its desirable properties. Translational Abstract Fostered by a series of unsuccessful attempts to replicate seemingly well-established empirical results, the reproducibility crisis has dominated the public debate in psychological science for several years. Apart from increasing awareness for the consequences of questionable research practices, the crisis has drawn attention to the shortcomings of currently dominating statistical procedures. Critically, conventional methods that allow for control of both Type I and Type II statistical error probabilities-alpha and beta, respectively- often require sample sizes much larger than typically employed. Therefore, we promote an alternative that requires substantially smaller sample sizes on average while still controlling error probabilities: sequential analysis. Unlike conventional tests, sequential tests are designed to be applied repeatedly during the sampling process and terminate as soon as there is sufficient evidence for one of the hypotheses of interest. Herein, we discuss the most efficient sequential design, the sequential probability ratio test (SPRT), and show how it is easily implemented for the common t test to compare means of 2 independent groups. We demonstrate by means of simulations that the SPRT reliably controls error probabilities and requires smaller samples than standard t tests or other common sequential designs. Moreover, we investigate the robustness of the SPRT against violations of its assumptions. Finally, we illustrate the sequential t test by applying it to an empirical example and provide concrete recommendations on how psychologists can employ it in their own research to benefit from its desirable properties.
引用
收藏
页码:206 / 226
页数:21
相关论文
共 85 条
[1]  
Anderson K., 2014, gsDesign: Group sequential design R package and its GUI
[2]   A MODIFICATION OF THE SEQUENTIAL PROBABILITY RATIO TEST TO REDUCE THE SAMPLE-SIZE [J].
ANDERSON, TW .
ANNALS OF MATHEMATICAL STATISTICS, 1960, 31 (01) :165-197
[3]  
[Anonymous], 1993, A handbook for data analysis in the behavioral sciences: Methodological issues, DOI DOI 10.1093/ACPROF:OSO/9780195153729.003.0013
[4]  
[Anonymous], 1986, The logic of scientific discovery
[5]  
[Anonymous], 1947, SEQUENTIAL ANAL
[6]  
[Anonymous], 2015, SCIENCE, DOI [DOI 10.1126/science.aac4716, DOI 10.1126/SCIENCE.AAC4716]
[7]   FIXED-SAMPLE-SIZE ANALYSIS OF SEQUENTIAL OBSERVATIONS [J].
ANSCOMBE, FJ .
BIOMETRICS, 1954, 10 (01) :89-100
[8]  
ARMITAGE P, 1957, BIOMETRIKA, V44, P9, DOI 10.2307/2333237
[9]  
Armitage P., 1947, J R STAT SOC B, V9, P250, DOI DOI 10.2307/2984117
[10]   Recommendations for Increasing Replicability in Psychology [J].
Asendorpf, Jens B. ;
Conner, Mark ;
De Fruyt, Filip ;
De Houwer, Jan ;
Denissen, Jaap J. A. ;
Fiedler, Klaus ;
Fiedler, Susann ;
Funder, David C. ;
Kliegl, Reinhold ;
Nosek, Brian A. ;
Perugini, Marco ;
Roberts, Brent W. ;
Schmitt, Manfred ;
vanAken, Marcel A. G. ;
Weber, Hannelore ;
Wicherts, Jelte M. .
EUROPEAN JOURNAL OF PERSONALITY, 2013, 27 (02) :108-119