Reliability From α to ω: A Tutorial

被引:301
作者
Revelle, William [1 ]
Condon, David M. [2 ,3 ]
机构
[1] Northwestern Univ, Dept Psychol, Swift Hall,2029 Sheridan Rd, Evanston, IL 60208 USA
[2] Northwestern Univ, Dept Med Social Sci, Evanston, IL 60208 USA
[3] Univ Oregon, Dept Psychol, Eugene, OR 97403 USA
基金
美国国家科学基金会;
关键词
reliability; generalizability; classical test theory; R packages; ITEM RESPONSE THEORY; COEFFICIENT ALPHA; INDIVIDUAL-DIFFERENCES; AFFECTIVE SYNCHRONY; INTERNAL STRUCTURE; WEIGHTED KAPPA; TRUE SCORES; PERSONALITY; LIFE; GENERALIZABILITY;
D O I
10.1037/pas0000754
中图分类号
B849 [应用心理学];
学科分类号
040203 ;
摘要
Reliability is a fundamental problem for measurement in all of science. Although defined in multiple ways, and estimated in even more ways, the basic concepts seem straightforward and need to be understood by practitioners as well as methodologists. Reliability theory is not just for the psychometrician estimating latent variables, it is for everyone who wants to make inferences from measures of individuals or of groups. For the case of a single test administration, we consider multiple measures of reliability, ranging from the worst (beta) to average (alpha, lambda(3)) to best ( lambda(4)) split half reliabilities, and consider why model-based estimates (omega(h), omega(t)) should be reported. We also address the utility of test-retest and alternate form reliabilities. The advantages of immediate versus delayed retests to decompose observed score variance into specific, state, and trait scores are discussed. But reliability is not just for test scores, it is also important when evaluating the use of ratings. Estimates that may be applied to continuous data include a set of intraclass correlations while discrete categorical data needs to take advantage of the family of kappa statistics. Examples of these various reliability estimates are given using state and trait measures of anxiety given with different delays and under different conditions. An online supplemental materials is provided with more detail and elaboration. The online supplemental materials is also used to demonstrate applications of open source software to examples of real data, and comparisons are made between the many types of reliability.
引用
收藏
页码:1395 / 1411
页数:17
相关论文
共 127 条
[1]   PSYCHOLOGY PSYCHOLOGISTS AND PSYCHOLOGICAL TESTING [J].
ANASTASI, A .
AMERICAN PSYCHOLOGIST, 1967, 22 (04) :297-&
[2]   IMPULSIVITY AND TIME OF DAY - IS RATE OF CHANGE IN AROUSAL A FUNCTION OF IMPULSIVITY [J].
ANDERSON, KJ ;
REVELLE, W .
JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 1994, 67 (02) :334-344
[3]   THE INTERACTIVE EFFECTS OF CAFFEINE, IMPULSIVITY AND TASK DEMANDS ON A VISUAL-SEARCH TASK [J].
ANDERSON, KJ ;
REVELLE, W .
PERSONALITY AND INDIVIDUAL DIFFERENCES, 1983, 4 (02) :127-134
[4]  
[Anonymous], 2019, R LANGUAGE ENV STAT
[5]  
[Anonymous], 2018, WILEY HDB PSYCHOMETR
[6]  
[Anonymous], PSYCHOMETRIKA, DOI DOI 10.1007/BF02288892
[7]  
[Anonymous], 1992, EMOTION REV PERSONAL
[8]  
[Anonymous], 1990, The Biopsychology of Mood and Arousal
[9]  
[Anonymous], 2016, RSTUDIO INT DEV ENV