Analyzing ordinal data with metric models: What could possibly go wrong?

被引:422
作者
Liddell, Torrin M. [1 ]
Kruschke, John K. [1 ]
机构
[1] Indiana Univ, Dept Psychol & Brain Sci, 1101 E 10th St, Bloomington, IN 47405 USA
关键词
Ordinal data; Likert; Ordered-probit; Bayesian analysis; LIKERT SCALE DATA; R PACKAGE; VIOLATIONS; VARIANCE; LEVEL;
D O I
10.1016/j.jesp.2018.08.009
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
We surveyed all articles in the Journal of Personality and Social Psychology (JPSP), Psychological Science (PS), and the Journal of Experimental Psychology: General (JEP:G) that mentioned the term "Likert," and found that 100% of the articles that analyzed ordinal data did so using a metric model. We present novel evidence that analyzing ordinal data as if they were metric can systematically lead to errors. We demonstrate false alarms (i.e., detecting an effect where none exists, Type I errors) and failures to detect effects (i.e., loss of power, Type II errors). We demonstrate systematic inversions of effects, for which treating ordinal data as metric indicates the opposite ordering of means than the true ordering of means. We show the same problems - false alarms, misses, and inversions - for interactions in factorial designs and for trend analyses in regression. We demonstrate that averaging across multiple ordinal measurements does not solve or even ameliorate these problems. A central contribution is a graphical explanation of how and when the misrepresentations occur. Moreover, we point out that there is no sure-fire way to detect these problems by treating the ordinal values as metric, and instead we advocate use of ordered-probit models (or similar) because they will better describe the data. Finally, although frequentist approaches to some ordered-probit models are available, we use Bayesian methods because of their flexibility in specifying models and their richness and accuracy in providing parameter estimates. An R script is provided for running an analysis that compares ordered-probit and metric models.
引用
收藏
页码:328 / 348
页数:21
相关论文
共 41 条
[1]  
[Anonymous], 1997, TECHNICAL REPORT
[2]   THE SENSITIVITY OF CONFIRMATORY MAXIMUM-LIKELIHOOD FACTOR-ANALYSIS TO VIOLATIONS OF MEASUREMENT SCALE AND DISTRIBUTIONAL ASSUMPTIONS [J].
BABAKUS, E ;
FERGUSON, CE ;
JORESKOG, KG .
JOURNAL OF MARKETING RESEARCH, 1987, 24 (02) :222-228
[3]   A GRAPHICAL EXPOSITION OF THE ORDERED PROBIT [J].
BECKER, WE ;
KENNEDY, PE .
ECONOMETRIC THEORY, 1992, 8 (01) :127-131
[4]  
Biirkner P.-C., 2018, ORDINAL REGRESSION M
[5]   THE EFFECTS OF VIOLATIONS OF ASSUMPTIONS UNDERLYING THE T-TEST [J].
BONEAU, CA .
PSYCHOLOGICAL BULLETIN, 1960, 57 (01) :49-64
[6]   brms: An R Package for Bayesian Multilevel Models Using Stan [J].
Buerkner, Paul-Christian .
JOURNAL OF STATISTICAL SOFTWARE, 2017, 80 (01) :1-28
[7]  
Carifio J., 2007, J Soc Sci, V3, P106, DOI [10.3844/jssp.2007.106.116, DOI 10.3844/JSSP.2007.106.116]
[8]   Resolving the 50-year debate around using and misusing Likert scales [J].
Carifio, James ;
Perla, Rocco .
MEDICAL EDUCATION, 2008, 42 (12) :1150-1152
[9]   Stan: A Probabilistic Programming Language [J].
Carpenter, Bob ;
Gelman, Andrew ;
Hoffman, Matthew D. ;
Lee, Daniel ;
Goodrich, Ben ;
Betancourt, Michael ;
Brubaker, Marcus A. ;
Guo, Jiqiang ;
Li, Peter ;
Riddell, Allen .
JOURNAL OF STATISTICAL SOFTWARE, 2017, 76 (01) :1-29
[10]  
Clason D.L., 1994, Journal of Agricultural Education, V35, P31, DOI [10.5032/jae.1994.04031, DOI 10.5032/JAE.1994.04031]