Linear Mixed-Effects Models and the Analysis of Nonindependent Data: A Unified Framework to Analyze Categorical and Continuous Independent Variables that Vary Within-Subjects and/or Within-Items

被引:377
作者
Brauer, Markus [1 ]
Curtin, John J. [1 ]
机构
[1] Univ Wisconsin, Dept Psychol, 1202 West Johnson St, Madison, WI 53706 USA
关键词
The analysis of nonindependent data; within-subjects designs; linear mixed-effects models; fixed and random effects; convergence problems; PARTICIPANTS RESPOND; FALLACY; SAMPLES; POWER;
D O I
10.1037/met0000159
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
In this article we address a number of important issues that arise in the analysis of nonindependent data. Such data are common in studies in which predictors vary within "units" (e.g., within-subjects, within-classrooms). Most researchers analyze categorical within-unit predictors with repeated-measures ANOVAs, but continuous within-unit predictors with linear mixed-effects models (LMEMs). We show that both types of predictor variables can be analyzed within the LMEM framework. We discuss designs with multiple sources of nonindependence, for example, studies in which the same subjects rate the same set of items or in which students nested in classrooms provide multiple answers. We provide clear guidelines about the types of random effects that should be included in the analysis of such designs. We also present a number of corrective steps that researchers can take when convergence fails in LMEM models with too many parameters. We end with a brief discussion on the trade-off between power and generalizability in designs with "within-unit" predictors. Translational Abstract Researchers and practitioners sometimes want to analyze data that are "nonindependent." Data are said to be nonindependent when the study is designed such that certain data points can be expected to be on average more similar to each other than other data points. This is usually the case when each subject provides multiple data points (so-called within-subject designs), when subjects belonging to higher-order units influence each other (e.g., students clustered in classrooms, employees clustered in teams), or when subjects react to or evaluate the same set of items (e.g., pictures, words, sentences, products, art works, target individuals). In the present article, we propose that all types of nonindependent data can be analyzed with the same statistical technique called "linear mixed-effects models." Compared to standard statistical tests belonging to the family of "General Linear Models" (e.g., ANOVA, regression), linear mixed-effects models have a "complex error term," i.e., the data analyst has to explicitly include all possible reasons for why the predictions of the statistical model may be wrong (these possible reasons are called "random effects"). It is not always obvious how to identify all possible sources of error. In this article, we provide clear guidelines on the type of random effects that researchers and practitioners should include when estimating linear mixed-effects models. Failure to include the appropriate random effects leads to an unacceptable false positive rate (or "type I error rate"), i.e., a high proportion of statistically significant results for effects that do not exist in reality.
引用
收藏
页码:389 / 411
页数:23
相关论文
共 45 条
  • [1] Estimating the reproducibility of psychological science
    Aarts, Alexander A.
    Anderson, Joanna E.
    Anderson, Christopher J.
    Attridge, Peter R.
    Attwood, Angela
    Axt, Jordan
    Babel, Molly
    Bahnik, Stepan
    Baranski, Erica
    Barnett-Cowan, Michael
    Bartmess, Elizabeth
    Beer, Jennifer
    Bell, Raoul
    Bentley, Heather
    Beyan, Leah
    Binion, Grace
    Borsboom, Denny
    Bosch, Annick
    Bosco, Frank A.
    Bowman, Sara D.
    Brandt, Mark J.
    Braswell, Erin
    Brohmer, Hilmar
    Brown, Benjamin T.
    Brown, Kristina
    Bruening, Jovita
    Calhoun-Sauls, Ann
    Callahan, Shannon P.
    Chagnon, Elizabeth
    Chandler, Jesse
    Chartier, Christopher R.
    Cheung, Felix
    Christopherson, Cody D.
    Cillessen, Linda
    Clay, Russ
    Cleary, Hayley
    Cloud, Mark D.
    Cohn, Michael
    Cohoon, Johanna
    Columbus, Simon
    Cordes, Andreas
    Costantini, Giulio
    Alvarez, Leslie D. Cramblet
    Cremata, Ed
    Crusius, Jan
    DeCoster, Jamie
    DeGaetano, Michelle A.
    Della Penna, Nicolas
    den Bezemer, Bobby
    Deserno, Marie K.
    [J]. SCIENCE, 2015, 349 (6251)
  • [2] [Anonymous], 2012, SAGE
  • [3] [Anonymous], 2015, PRINCIPLES PRACTICE
  • [4] The cave of shadows: Addressing the human factor with generalized additive mixed models
    Baayen, Harald
    Vasishth, Shravan
    Kliegl, Reinhold
    Bates, Douglas
    [J]. JOURNAL OF MEMORY AND LANGUAGE, 2017, 94 : 206 - 234
  • [5] Mixed-effects modeling with crossed random effects for subjects and items
    Baayen, R. H.
    Davidson, D. J.
    Bates, D. M.
    [J]. JOURNAL OF MEMORY AND LANGUAGE, 2008, 59 (04) : 390 - 412
  • [6] If It's Difficult to Pronounce, It Might Not Be Risky: The Effect of Fluency on Judgment of Risk Does Not Generalize to New Stimuli
    Bahnik, Stepan
    Vranka, Marek A.
    [J]. PSYCHOLOGICAL SCIENCE, 2017, 28 (04) : 427 - 436
  • [7] Random effects structure for testing interactions in linear mixed-effects models
    Barr, Dale J.
    [J]. FRONTIERS IN PSYCHOLOGY, 2013, 4
  • [8] Random effects structure for confirmatory hypothesis testing: Keep it maximal
    Barr, Dale J.
    Levy, Roger
    Scheepers, Christoph
    Tily, Harry J.
    [J]. JOURNAL OF MEMORY AND LANGUAGE, 2013, 68 (03) : 255 - 278
  • [9] Bates D., 2015, Parsimonious mixed models. arXiv preprint arXiv:1506.04967, DOI DOI 10.18637/JSS.V067.I01
  • [10] Fitting Linear Mixed-Effects Models Using lme4
    Bates, Douglas
    Maechler, Martin
    Bolker, Benjamin M.
    Walker, Steven C.
    [J]. JOURNAL OF STATISTICAL SOFTWARE, 2015, 67 (01): : 1 - 48