Categorical data analysis: Away from ANOVAs (transformation or not) and towards logit mixed models

被引:2509
作者
Jaeger, T. Florian [1 ]
机构
[1] Univ Rochester, Rochester, NY 14627 USA
关键词
Arcsine-square-root transformation; Logistic regression; Mixed logit models; Categorical data analysis;
D O I
10.1016/j.jml.2007.11.007
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
This paper identifies several serious problems with the widespread use of ANOVAs for the analysis of categorical outcome variables such as forced-choice variables, question-answer accuracy, choice in production (e.g. in syntactic priming research), et cetera. I show that even after applying the arcsine-square-root transformation to proportional data, ANOVA can yield spurious results. I discuss conceptual issues underlying these problems and alternatives provided by modern statistics. Specifically, I introduce ordinary logit models (i.e. logistic regression), which are well-suited to analyze categorical data and offer many advantages over ANOVA. Unfortunately, ordinary logit models do not include random effect modeling. To address this issue, I describe mixed logit models (Generalized Linear Mixed Models for binomially distributed outcomes, Breslow and Clayton [Breslow, N. E. & Clayton, D. G. (1993). Approximate inference in generalized linear mixed models. Journal of the American Statistical Society 88(421), 9-25]), which combine the advantages of ordinary logit models with the ability to account for random subject and item effects in one step of analysis. Throughout the paper, I use a psycholinguistic data set to compare the different statistical methods. (C) 2007 Elsevier Inc. All rights reserved.
引用
收藏
页码:434 / 446
页数:13
相关论文
共 38 条
[1]  
[Anonymous], LINEAR MIXED M UNPUB
[2]  
[Anonymous], 2011, Categorical data analysis
[3]  
[Anonymous], 1971, Statistical Principles in Experimental Design
[4]  
[Anonymous], 1989, Analysis of binary data
[5]  
ARNON I, 2008, RETHINKING CHI UNPUB
[6]  
ARNON I, 2006, 9 ANN CUNY C HUM SEN
[7]   Mixed-effects modeling with crossed random effects for subjects and items [J].
Baayen, R. H. ;
Davidson, D. J. ;
Bates, D. M. .
JOURNAL OF MEMORY AND LANGUAGE, 2008, 59 (04) :390-412
[8]  
BARTLETT M. S., 1937, J. Roy. Statist. Soc., (Suppl.), V4, P137
[9]  
Bates D, 2007, IME4 LINEAR MIXED EF
[10]   Linear mixed models and penalized least squares [J].
Bates, DM ;
DebRoy, S .
JOURNAL OF MULTIVARIATE ANALYSIS, 2004, 91 (01) :1-17