Improving Inferences About Null Effects With Bayes Factors and Equivalence Tests

被引:215
作者
Lakens, Daniel [1 ]
McLatchie, Neil [2 ]
Isager, Peder M. [1 ]
Scheel, Anne M. [1 ]
Dienes, Zoltan [3 ]
机构
[1] Eindhoven Univ Technol, Dept Human Technol Interact, Eindhoven, Netherlands
[2] Univ Lancaster, Dept Psychol, Lancaster, England
[3] Sussex Univ, Sch Psychol, Brighton, E Sussex, England
来源
JOURNALS OF GERONTOLOGY SERIES B-PSYCHOLOGICAL SCIENCES AND SOCIAL SCIENCES | 2020年 / 75卷 / 01期
关键词
Bayesian statistics; Falsification; Frequentist statistics; Hypothesis testing; TOST; T TESTS; PAIN; HEALTH;
D O I
10.1093/geronb/gby065
中图分类号
R592 [老年病学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 100203 ;
摘要
Researchers often conclude an effect is absent when a null-hypothesis significance test yields a nonsignificant p value. However, it is neither logically nor statistically correct to conclude an effect is absent when a hypothesis test is not significant. We present two methods to evaluate the presence or absence of effects: Equivalence testing (based on frequentist statistics) and Bayes factors (based on Bayesian statistics). In four examples from the gerontology literature, we illustrate different ways to specify alternative models that can be used to reject the presence of a meaningful or predicted effect in hypothesis tests. We provide detailed explanations of how to calculate, report, and interpret Bayes factors and equivalence tests. We also discuss how to design informative studies that can provide support for a null model or for the absence of a meaningful effect. The conceptual differences between Bayes factors and equivalence tests are discussed, and we also note when and why they might lead to similar or different inferences in practice. It is important that researchers are able to falsify predictions or can quantify the support for predicted null effects. Bayes factors and equivalence tests provide useful statistical tools to improve inferences about null effects.
引用
收藏
页码:45 / 57
页数:13
相关论文
共 44 条
[1]   When power analyses based on pilot data are biased: Inaccurate effect size estimators and follow-up bias [J].
Albers, Casper ;
Lakens, Daniel .
JOURNAL OF EXPERIMENTAL SOCIAL PSYCHOLOGY, 2018, 74 :187-195
[2]   STATISTICS NOTES - ABSENCE OF EVIDENCE IS NOT EVIDENCE OF ABSENCE [J].
ALTMAN, DG ;
BLAND, JM .
BRITISH MEDICAL JOURNAL, 1995, 311 (7003) :485-485
[3]  
[Anonymous], J GERONTOLOGY B
[4]  
[Anonymous], JAM VERS 0 8
[5]  
[Anonymous], 2017, BAYES FACTORS CONT 1
[6]  
[Anonymous], 2018, ARXIV170808278
[7]   When to use the Bonferroni correction [J].
Armstrong, Richard A. .
OPHTHALMIC AND PHYSIOLOGICAL OPTICS, 2014, 34 (05) :502-508
[8]   Long-term Associations Between Physical Frailty and Performance in Specific Cognitive Domains [J].
Bunce, David ;
Batterham, Philip J. ;
Mackinnon, Andrew J. .
JOURNALS OF GERONTOLOGY SERIES B-PSYCHOLOGICAL SCIENCES AND SOCIAL SCIENCES, 2019, 74 (06) :919-926
[9]   Instead of "playing the game" it is time to change the rules: Registered Reports at AIMS Neuroscience and beyond [J].
Chambers, Christopher D. ;
Feredoes, Eva ;
Muthukumaraswamy, Suresh D. ;
Etchells, Peter J. .
AIMS NEUROSCIENCE, 2014, 1 (01) :4-17
[10]  
de Groot A., 1969, Methodology