The Problem with "Magnitude-based Inference"

被引:90
作者
Sainani, Kristin L. [1 ]
机构
[1] Stanford Univ, Div Epidemiol, Dept Hlth Res & Policy, Stanford, CA 94305 USA
关键词
TYPE I ERROR; TYPE II ERROR; HYPOTHESIS TESTING; CONFIDENCE INTERVALS; STATISTICS; ERROR RATES; STATISTICS;
D O I
10.1249/MSS.0000000000001645
中图分类号
G8 [体育];
学科分类号
04 ; 0403 ;
摘要
Purpose A statistical method called magnitude-based inference (MBI) has gained a following in the sports science literature, despite concerns voiced by statisticians. Its proponents have claimed that MBI exhibits superior type I and type II error rates compared with standard null hypothesis testing for most cases. I have performed a reanalysis to evaluate this claim. Methods Using simulation code provided by MBI's proponents, I estimated type I and type II error rates for clinical and nonclinical MBI for a range of effect sizes, sample sizes, and smallest important effects. I plotted these results in a way that makes transparent the empirical behavior of MBI. I also reran the simulations after correcting mistakes in the definitions of type I and type II error provided by MBI's proponents. Finally, I confirmed the findings mathematically; and I provide general equations for calculating MBI's error rates without the need for simulation. Results Contrary to what MBI's proponents have claimed, MBI does not exhibit superior type I and type II error rates to standard null hypothesis testing. As expected, there is a tradeoff between type I and type II error. At precisely the small-to-moderate sample sizes that MBI's proponents deem optimal, MBI reduces the type II error rate at the cost of greatly inflating the type I error rateto two to six times that of standard hypothesis testing. Conclusions Magnitude-based inference exhibits worrisome empirical behavior. In contrast to standard null hypothesis testing, which has predictable type I error rates, the type I error rates for MBI vary widely depending on the sample size and choice of smallest important effect, and are often unacceptably high. Magnitude-based inference should not be used.
引用
收藏
页码:2166 / 2176
页数:11
相关论文
共 19 条
[1]   Inference About Magnitudes of Effects [J].
Barker, Richard J. ;
Schofield, Matthew R. .
INTERNATIONAL JOURNAL OF SPORTS PHYSIOLOGY AND PERFORMANCE, 2008, 3 (04) :547-557
[2]   The Numbers Will Love You Back in Return-I Promise [J].
Buchheit, Martin .
INTERNATIONAL JOURNAL OF SPORTS PHYSIOLOGY AND PERFORMANCE, 2016, 11 (04) :551-554
[3]   Beta-Alanine Supplementation Improves Throwing Velocities in Repeated Sprint Ability and 200-m Swimming Performance in Young Water Polo Players [J].
Claus, Gabriel Machado ;
Redkva, Paulo Eduardo ;
Pinheiro Brisola, Gabriel Mota ;
Malta, Elvis Sousa ;
Bonetti de Poli, Rodrigo de Araujo ;
Miyagi, Willian Eiji ;
Zagatto, Alessandro Moura .
PEDIATRIC EXERCISE SCIENCE, 2017, 29 (02) :203-212
[4]  
Cohen J., 1988, STAT POWER ANAL BEHA, DOI [10.4324/9780203771587, DOI 10.4324/9780203771587]
[5]   Explorations in statistics: confidence intervals [J].
Curran-Everett, Douglas .
ADVANCES IN PHYSIOLOGY EDUCATION, 2009, 33 (02) :87-90
[6]   Bayesian statistics in medical research: an intuitive alternative to conventional data analysis [J].
Gurrin, LC ;
Kurinczuk, JJ ;
Burton, PR .
JOURNAL OF EVALUATION IN CLINICAL PRACTICE, 2000, 6 (02) :193-204
[7]  
Hopkins W.G., 2006, Sportscience, V10, P63
[8]  
Hopkins WG, 2018, SAMPLE SIZE ESTIMATI
[9]  
Hopkins WG., 2007, Sportscience, V11, P16
[10]   Error Rates, Decisive Outcomes and Publication Bias with Several Inferential Methods [J].
Hopkins, Will G. ;
Batterham, Alan M. .
SPORTS MEDICINE, 2016, 46 (10) :1563-1573