Data Aggregation Can Lead to Biased Inferences in Bayesian Linear Mixed Models and Bayesian Analysis of Variance

被引:5
作者
Schad, Daniel J. [1 ,4 ]
Nicenboim, Bruno [2 ]
Vasishth, Shravan [3 ]
机构
[1] HMU Hlth & Med Univ, Inst Mind Brain & Behav, Potsdam, Germany
[2] Tilburg Univ, Dept Cognit Sci & Artificial Intelligence, Tilburg, Netherlands
[3] Univ Potsdam, Dept Linguist, Potsdam, Germany
[4] HMU Hlth & Med Univ, Inst Mind Brain & Behav, Olymp Weg 1, D-14471 Potsdam, Germany
关键词
Bayes factors; Bayesian model comparison; simulation-based calibration; sphericity assumption; items; FIXED-EFFECT FALLACY; R PACKAGE; SPECIAL-ISSUE; LANGUAGE; ERROR; TESTS;
D O I
10.1037/met0000621
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Bayesian linear mixed-effects models (LMMs) and Bayesian analysis of variance (ANOVA) are increasingly being used in the cognitive sciences to perform null hypothesis tests, where a null hypothesis that an effect is zero is compared with an alternative hypothesis that the effect exists and is different from zero. While software tools for Bayes factor null hypothesis tests are easily accessible, how to specify the data and the model correctly is often not clear. In Bayesian approaches, many authors use data aggregation at the by-subject level and estimate Bayes factors on aggregated data. Here, we use simulation-based calibration for model inference applied to several example experimental designs to demonstrate that, as with frequentist analysis, such null hypothesis tests on aggregated data can be problematic in Bayesian analysis. Specifically, when random slope variances differ (i.e., violated sphericity assumption), Bayes factors are too conservative for contrasts where the variance is small and they are too liberal for contrasts where the variance is large. Running Bayesian ANOVA on aggregated data can-if the sphericity assumption is violated-likewise lead to biased Bayes factor results. Moreover, Bayes factors for by-subject aggregated data are biased (too liberal) when random item slope variance is present but ignored in the analysis. These problems can be circumvented or reduced by running Bayesian LMMs on nonaggregated data such as on individual trials, and by explicitly modeling the full random effects structure. Reproducible code is available from https://osf.io/mjf47/.
引用
收藏
页数:37
相关论文
共 61 条
[1]  
[Anonymous], 2022, JASP (Version 0.16.1)
[2]   Random effects structure for confirmatory hypothesis testing: Keep it maximal [J].
Barr, Dale J. ;
Levy, Roger ;
Scheepers, Christoph ;
Tily, Harry J. .
JOURNAL OF MEMORY AND LANGUAGE, 2013, 68 (03) :255-278
[3]  
Bates D, 2018, Arxiv, DOI [arXiv:1506.04967, 10.48550/arxiv.1506.04967, DOI 10.48550/ARXIV.1506.04967]
[4]  
Bates D, 2015, J STAT SOFTW, V67, P1, DOI [10.1007/s13201-024-02166-7, 10.3390/agronomy15020428]
[5]   EFFICIENT ESTIMATION OF FREE-ENERGY DIFFERENCES FROM MONTE-CARLO DATA [J].
BENNETT, CH .
JOURNAL OF COMPUTATIONAL PHYSICS, 1976, 22 (02) :245-268
[6]  
Betancourt M, 2018, Arxiv, DOI [arXiv:1803.08393, 10.48550/ARXIV.1803.08393]
[7]   SOME THEOREMS ON QUADRATIC FORMS APPLIED IN THE STUDY OF ANALYSIS OF VARIANCE PROBLEMS .1. EFFECT OF INEQUALITY OF VARIANCE IN THE ONE-WAY CLASSIFICATION [J].
BOX, GEP .
ANNALS OF MATHEMATICAL STATISTICS, 1954, 25 (02) :290-302
[8]  
Bürkner PC, 2018, R J, V10, P395
[9]   brms: An R Package for Bayesian Multilevel Models Using Stan [J].
Buerkner, Paul-Christian .
JOURNAL OF STATISTICAL SOFTWARE, 2017, 80 (01) :1-28
[10]   Stan: A Probabilistic Programming Language [J].
Carpenter, Bob ;
Gelman, Andrew ;
Hoffman, Matthew D. ;
Lee, Daniel ;
Goodrich, Ben ;
Betancourt, Michael ;
Brubaker, Marcus A. ;
Guo, Jiqiang ;
Li, Peter ;
Riddell, Allen .
JOURNAL OF STATISTICAL SOFTWARE, 2017, 76 (01) :1-29